[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: XML Blueberry

  • From: Rick Jelliffe <ricko@a...>
  • To: xml-dev@l...
  • Date: Fri, 22 Jun 2001 16:02:07 +0800

xml handles spaces
From: "Elliotte Rusty Harold" <elharo@m...>

> And XML handles these perfectly. Indeed when you're writing or
> reading XML you simply don't care which line ending convention was
> used, which is the way it should be.

I'm with Rusty.

Lets not get confused!  (That should be "Note to self: try not to get
confused!") There are two separate cases here.

The first case is where XML is generated by a program, running on an IBM
system with this convention.  In that case, there is no need to extend the
characters which the XML parser recognises as whitespace, because the
characters sent are under programmer control.   And the parser does not
(should not) care about whether the IBM line-end character is sent as part
of data.
This only requires that the IBM line-end character should be allowed as part
of the document character set.  I think this should be uncontraversial, and
only requires a 3rd edition of XML, as a correction.

The second case is where we want to edit XML on an IBM system which, out of
the control of the user, inserts IBM line-end characters when the user is
typing in their markup.   To me this second case is no different to the case
of East Asians typing with editors that stick in ideographic spaces rather
than ASCII spaces: tough luck, you need to run the data through a converted.

So neither of these cases justify adding IBM new-lines to the whitespace
characters recognised by XML tokenizers.

So perhaps the following is a reasonable compromise:

 1) upgrade the document character set to Unicode 3.1 as a 3rd edition
 2) state that "XML processors may, at user option, if they detect the
    IBM newline or  any other visual white-space in markup, element content
or
    in an entity/XML declaration, replace the characters with LF, as a
matter of
    entity management."

This keeps the status of those characters w.r.t. XML 1.0 clear, in
particular
the fact that they will cause interoperability problems when used with
other XML documents, but it provides a workaround for inhouse use.

Cheers
Rick Jelliffe



PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.