[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: XML 1.1 and Unicode normalization

unicode normalization checking

John Cowan wrote:
> james anderson scripsit:
> > it would be clear how to proceed if xml-11 incorporates charmod and requires
> > processors to normalize - thereby entailing normalization-checking.
> >
> > to specify that the processor must, at option, check normalization, but must
> > not transform to normal form, while the referenced specification requires that
> > all "producers of strings" ensure (not just check) that they are normalized,
> > led this implementer to put the proposals back, to wait for later versions.
> XML parsers are considered consumers, not producers. 

that is one of the less intuitively obvious things in these specs.

>   Therefore,
> they should normalization-check in accordance with CharMod.  For practical
> reasons it was decided not to make normalization-checking required.

there's this passage in charmod which goes something like "a text processing
component [an instance of which i would expect an xml processor to be] that
receives suspect text [instances of which i would, in general, expect
documents to be] must not perform any normalization-sensistive operations
[instances of which i would expect any name construction and comparison
operations to be] unless it has first confirmed through inspection that the
text is in normalized form, ...."

which renders the distinction between consumers and producers academic.
unless there some way to interpret the passage so that it does not apply to
things like start/end tag matching, attribute defaulting, and validation.

what is more, the passage continues with the proscription, that "[a text
processing component] must not normalize the suspect text." 

which left me wondering whether a parser would be conformant if, when it
signalled an exception upon determining that it was about to construct a name
from a non-nfc string, it at least offered the application a restart which
attempted to normalize the namestring and continue.



Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
First Name
Last Name
Subscribe in XML format
RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.