[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Specifying a Unicode subset
Gustaf Liljegren scripsit: > This way, those who need to use characters in the intervals forbidden in > XML 1.0 would have the freedom to use them, while the rest of us are left > unaffected. The 65 ISO controls that are forbidden are (all but one) representable as character references. The point of forbidding them is to improve character-encoding detection in a world where most documents are not Unicode-encoded. For example, because U+0080 is forbidden, a Windows-1252 document mislabeled as Latin-1 will cough on the Euro sign, because it will incorrectly be mapped to U+0080 instead of the correct U+20AC. This is a Good Thing. > If I'd decide, there would be no change in XML. But if a new version is > unavoidable and I need to pick one, I'd rather go for a more flexible > solution, because I fear that 1.1 won't be the last version of its kind. I believe that 1.1 will be the last release related purely to characters. If there is an XML 2.0, it will be about entirely different issues. -- Some people open all the Windows; John Cowan wise wives welcome the spring jcowan@r... by moving the Unix. http://www.reutershealth.com --ad for Unix Book Units (U.K.) http://www.ccil.org/~cowan (see http://cm.bell-labs.com/cm/cs/who/dmr/unix3image.gif)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|