[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: encoding problem fixed
James Tauber wrote: > In other words, rather than creating an InputSource using a FileReader, I > used James Clark's "fileInputSource" method in XT to make a URL out of a > file and create the InputSource from the URL string. Yes, indeed. You should never use a Reader of any sort when processing XML (unless you have a non-standard Reader class that understands the XML declaration). Always use an InputSource so that the parser can install its own bytes-to-chars converter based on the declaration. > The culprit is FileReader. It is the one doing the strange "read UTF-8 as > Windows code page". Actually, it's doing what it's expected to: reading the native charset, CP-1252. (Unix JVMs use 8859-1 instead.) It has no way of knowing that *you* think the document charset is UTF-8. -- John Cowan http://www.ccil.org/~cowan cowan@c... Schlingt dreifach einen Kreis um dies! / Schliesst euer Aug vor heiliger Schau, Denn er genoss vom Honig-Tau / Und trank die Milch vom Paradies. -- Coleridge / Politzer xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i... Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1 To (un)subscribe, mailto:majordomo@i... the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@i... the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@i...)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|