[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: encoding problem fixed
Elliotte Rusty Harold wrote: > > It's possible to start with an InputStream to read the XML declaration, > then chain that InputStream to an InputStreamReader once the encoding is > known and never use the InputStream directly again. Since the XML > declaration is ASCII (possibvly aside from a byte order mark) this isn't > all that difficult to implement. Make that a a "PushbackInputStream" ... remember that XML (and text) declarations are optional, and if you're not tying to the parser's prolog logic, you'll need to feed it a stream of characters. Consider a document starting "<tag>" where instead of "t" you've got a multibyte UTF-8 encoded name. Without some pushback there, you're likely in severe trouble unless you handle the UTF-8 directly ... that "ASCII only" assumption can fail _very_ quickly. - Dave xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i... Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1 To (un)subscribe, mailto:majordomo@i... the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@i... the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@i...)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|