[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: The subsetting has begun


cr within tags jdom
Elliotte Rusty Harold scripsit:

> If you think any one data model is going to suffice, 
> you're kidding yourself. All they have in common is XML syntax (and 
> not always that since the infoset and DOM can both create 
> non-well-formed documents)

Distinguo.  The Infoset can't "create" anything; rather, infosets are created
from documents.  There is no notion of creating or modifying anything in the
infoset.  It is not a data model in the sense of DOM/JDOM/XOM, despite the
superficial similarity:  it is a minimally abstract representation of
UnicodeWithAngleBrackets syntax, where only silly distinctions are thrown away
(along with most DTD information, omitted from the list below):

   4. White space outside the document element.
   5. White space immediately following the target name of a PI.
   6. Whether characters are represented by character references.
   7. The difference between the two forms of an empty element: <foo/>
      and <foo></foo>.
   8. White space within start-tags (other than significant white space
      in attribute values) and end-tags.
   9. The difference between CR, CR-LF, and LF line termination.
  10. The order of attributes within a start-tag.
  17. The kind of quotation marks (single or double) used to quote
      attribute values.
  18. The boundaries of general parsed entities.
  19. The boundaries of CDATA marked sections.

If your XML application, Walter, depends on any of these facts, then an Infoset
representation of the XML document will not serve you (e.g. a decent XML
editor).  But otherwise, the syntax and the Infoset are indeed twins.

-- 
First known example of political correctness:   John Cowan
"After Nurhachi had united all the other        http://www.reutershealth.com
Jurchen tribes under the leadership of the      http://www.ccil.org/~cowan
Manchus, his successor Abahai (1592-1643)       jcowan@r...
issued an order that the name Jurchen should       --S. Robert Ramsey,
be banned, and from then on, they were all         _The Languages of China_
to be called Manchus."

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.