Re: text extraction
Andrew Welch wrote: > On 10/12/06, mus47@xxxxxxxx <mus47@xxxxxxxx> wrote: >> And also I want to now how can the output file encoding setted to >> iso8859-1 instead of utf8. >> I use the xsltproc tool. > > You can set the output encoding using <xsl:output/>
Are you sure? Interestingly the spec states:
"The value of the encoding attribute provides the value of the encoding parameter to the serialization method. The default value is implementation-defined, but in the case of the xml and xhtml methods it must be either UTF-8 or UTF-16."
...which took me a little by surprise - It seems to say that when the output method is xml or xhtml the encoding MUST be either UTF-8 or UTF-16? Saxon doesn't seem to mind...
Also note, the first 127 codepoints when encoded as ISO-8859-1 or UTF-8 are exactly equal. Only ISO 128 (sometimes euro sign, but you may see something different: ) and above are treated differently.
PURCHASE STYLUS STUDIO ONLINE TODAY!
Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!
Download The World's Best XML IDE!
Accelerate XML development with our award-winning XML IDE - Download a free trial today!
Subscribe in XML format