Re: Blueberry/Unicode/XML

From: Rob Lugt <roblugt@e...>
To: James Clark <jjc@j...>, Tim Bray <tbray@t...>,xml-dev@l...
Date: Tue, 10 Jul 2001 09:58:35 +0100

Play the video

James Clark wrote:
> > 1. Leave it the way it is.
> > 2. Do Blueberry and then repeat the process for Unicode 3.2
> >    and 4.0 and so on every couple of years forever.
> > 3. Bite the bullet, write the rules in terms of Unicode
> >    metadata and go to a pure use-by-reference architecture,
> >    probably adding a syntactic signal to reference the
> >    Unicode version number.
>
> I don't find any of these options very appealing.
>
> Another bullet one could bite is to no longer make checking of name
> characters (beyond what is needed to prevent ambiguity) a part of
> well-formedness.  Whilst it's nice to have some sanity checking of names,
> using inappropriate characters in names doesn't cause problems for further
> processing layers to the same extent as other things that are part of
> well-formedness do, such as unbalanced tags or duplicate attributes.
>
> At least I think one should consider easing draconian error handling for
bad
> name characters to reduce deployment problems with option 2.
>

This sounds like an elegant and simple proposal.  It satisfies Tim's desire
to be able to eventually say XML is "complete" and it might actually result
in more efficient parsers if it removes the need for an XML character table.

However, I presume there was a good reason why the current name character
scheme was implemented.  The reasons I can think of are easily dismissed or
dealt with.  Are there any other more serious implications?

- Obviously certain mark-up characters must be excluded from names: [ " '
( ) < > & 0x9 0xA 0x20 ...]
- Characters appearing as whitespace could be confusing
- er any others?

~Rob

Follow-Ups:
- Re: Blueberry/Unicode/XML
  - From: "Christopher R. Maden" <crism@m...>
- Re: Blueberry/Unicode/XML
  - From: Rick Jelliffe <ricko@a...>

References:
- Presumption of XML's Stability (was RE: XML Blueberry (non-ASCII namecharacters in Japan))
  - From: Mike.Champion@S...
- Blueberry/Unicode/XML
  - From: Tim Bray <tbray@t...>
- Re: Blueberry/Unicode/XML
  - From: James Clark <jjc@j...>

Prev by Date: RE: (OT) OS other than Win
Next by Date: Re: Accepting non-deterministic content models
Previous by thread: Re: Blueberry/Unicode/XML
Next by thread: Re: Blueberry/Unicode/XML
Index(es):
- Date
- Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Subscribe in XML format

RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >