|
[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Some comments on the 1.1 draft
From: "John Cowan" <cowan@m...> > Rick Jelliffe scripsit: > > > Even if you only use ISO 8859-1, it is still important. The Euro=0x80 > > mistake will be increasingly common, and we need to make sure that > > XML processors continue to catch this error. > > But they don't! > > Characters U+0080 through U+009F are legal XML content. > > You are talking about a "defense" that doesn't even exist. No. 0x0085 is not AFAIK a character in ISO 8859-1 (it is one of the design principles of 8859-1 that it will not fail on systems that mask the 8th bit and look for control characters). So a document labelled as ISO 8859-1 but with an 0x85 false Euro should fail on import. The 85 character not existing in 8859-1, it never gets as far as Unicode. MSXML 4 gets this right, and gives an error at those times. I have had a support request on this with our validator, so I had to look into it. The defense does exist. Cheers Rick Jelliffe
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|
|||||||||

Cart








