[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Mix encodings in a document?
Tony Graham scripsit: > Surrogate pairs are not allowed in parsed entities. The production > for Char excludes the surrogate blocks: > > [2] Char::= #x9 | #xA | #xD | [#x20-#xD7FF] | [#xE000-#xFFFD] > | [#x10000-#x10FFFF] On the contrary. UTF-16 is a standard representation that XML systems must accept (clause 4.3.3), and the representation of the characters #x10000-#x10FFFF in UTF-16 (which is the same as Unicode 2.x) is precisely a surrogate pair. Individual surrogate characters are excluded, but they have no meaning in UTF-16 anyway. > You can include non-BMP/non-UCS-2 characters by making numeric > references to their Unicode Scalar Value (or by using UCS-4). That works too. -- John Cowan http://www.ccil.org/~cowan cowan@c... You tollerday donsk? N. You tolkatiff scowegian? Nn. You spigotty anglease? Nnn. You phonio saxo? Nnnn. Clear all so! 'Tis a Jute.... (Finnegans Wake 16.5) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i... Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@i... the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@i... the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@i...)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|