[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

XML-1.1 -- just ignore it


xml ignore
I see XML 1.1 is out, and it is so crazy that it is funny.  My considered
recommendation is we all have a good laugh, and then forget about it.

By allowing any character in names, it means that we can have WF XML 1.1
documents which merely opening in a text editor (even an editor for the
document encoding) will corrupt with a well-formedness error: if people use
characters in names which may be split at by automated line-wrapping.  A
markup language which safe practise is to *never* open an entity in a text
editor? Excellent advance!

I would guess that putting in Issue 18 and Issue 21 (should control
characters
be allowed?  should 0x00 be allowed?) are just sacrificial lambs, put in to
be removed later but not serious suggestions. A markup language which was
unsafe to store in files or to transmit on serial lines or as text/*?
Should be a winner!

It would be interesting to speculate what principle causes characters to be
considered whitespace:  certainly it is not that all visible space should be
whitespace (one sensisble rule) or that only ASCII should be space.
Why is not just mapping NEL to #A on input enough to satisfy the IBM
requirement?
This gives us a markup language in which all markup a WF document could look
by inspection as if every character is ASCII but could not be serialized out
to ASCII. because of NELs or LS characters.  Not a common problem, but a
hole.

Another great joke is to "simplify" the naming rules to free a parser from
having to worry about future upgrades to Unicode, but then requiring
Normalized data (and suggesting it should be an error): surely this just
ties the parser to having to know  a particular version of Unicode to know
which normalization rules to use!

Of course, the real way to get independence from Unicode changes is to
define name rules in terms of Unicode properties.  There is a set of Unicode
properties specifically to be used to determine which characters can be used
in identifier. By allowing more characters in names, the XML WG is not
supporting more of Unicode, but less.

ROFL
Rick Jelliffe





PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.