Re: Supporting Unicode (was Some comments on the 1.1 draft)
John Cowan <cowan@m...> wrote: > Rick Jelliffe scripsit: > > That makes it clear that control characters are unlike other characters, > > for which Unicode provides "semantics". The only C0 or C1 characters for > > which Unicode provides "semantics" are TAB, CR, LF and NEL. > > XML already, however, allows the use of undefined codepoints, which have > far less semantics than the C0 controls. And a good thing too, or > Ethiopic and Thaana and Canadian Aboriginal Syllabics would be totally > locked out of XML (they are post-Unicode-2.0) instead of merely > banned in XML names. Undefined codepoints have the semantic of "potential site for a future Unicode character codepoint". It seems to me unlikely that Unicode will assign any additional character semantics to the C0 and C1 blocks, making the allowance for C0 controls in XML of dubious value as a "future-proofing" measure. Cheers, -Peter S. Housel- housel@a... http://members.home.com/housel/
PURCHASE STYLUS STUDIO ONLINE TODAY!
Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!
Download The World's Best XML IDE!
Accelerate XML development with our award-winning XML IDE - Download a free trial today!
Subscribe in XML format