|
[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Mix encodings in a document?
Deke Smith asked about Gavin Thomas Nicol's remark: > >Remember: byte != character code != character != glyph A character code may be more than one byte long, but is always an integer. A character is an abstract object which can be represented by different character codes in different coded character sets (ASCII, EBCDIC/US, JIS X 0208, etc.) Glyphs are abstractions of *appearance*, whereas characters are abstractions of *function*. > ISO-10646-UCS-2 > ISO-10646-UCS-4 > ISO-10646-UTF-1 > ISO-10646-Unicode-Latin1 > ISO-10646-J-1 > UNICODE-1-1 > UNICODE-1-1-UTF-7 > UTF-7 > UTF-8 ISO-10646-UCS-2 is near enough UTF-16; UTF-16 only implies that surrogates are correctly processed, and decent UCS-2 implementations will at worst leave surrogates alone. -- John Cowan http://www.ccil.org/~cowan cowan@c... You tollerday donsk? N. You tolkatiff scowegian? Nn. You spigotty anglease? Nnn. You phonio saxo? Nnnn. Clear all so! 'Tis a Jute.... (Finnegans Wake 16.5) xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i... Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@i... the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@i... the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@i...)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|
|||||||||

Cart








