[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: XML and special Characters : unicode v3.0 ?
At 02:09 PM 3/1/99 -0500, John Cowan wrote: >Timothaeus Bray scripsit: > >> [D]id you know the BOM was legal in UTF-8? > >The BOM isn't just a BOM, it's also the ZWNBSP (zero-width >non-breaking space; no, I do not know how to pronounce that >acronym) character, and is interpreted as a BOM only at the >beginning of UCS-2 or UTF-16 documents. Not to worry; the character is >as near to a no-op as Unicode allows for. I think there is reason for worry. In a UTF-16 document, you can have a BOM and then the <?xml version=?>, and that PI will still be recognized as the XML declaration. The spec is, I think, pretty clear, that a ZWNBSP or any other *data* character before the XML declaration is verboten. So... it seems that in UTF8, a ZWNBSP as first character in the file isn't a data character. Blecch. -Tim xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i... Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1 To (un)subscribe, mailto:majordomo@i... the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@i... the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@i...)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|