[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Non-infoset


Non-infoset


> -----Original Message-----
> From: Bob Foster [mailto:bob@o...] 
> Sent: Sunday, April 10, 2005 16:45
> To: Amelia A Lewis
> Cc: xml-dev@l...
> Subject: Re:  Non-infoset
> 
> 
> Amelia A Lewis wrote:
> > On 2005-04-10 15:25:51 -0400 Alessandro Triglia 
> <sandro@m...> wrote:
> >> - the nature and amount of whitespace inside tags;
> > 
> > What?
> > 
> > <p>This sentence contains <em>emphasized</em> text.</p>
> > 
> > Has important characteristics in the whitespace.
> 
> Yes, but there isn't anything important about the whitespace 
> within tags:
> 
> <p   >These tags contain <em>superfluous</em> whitespace.</p  >
> 
> I agree with the rest of your comments.
> 
> Even whitespace within tags has its uses, e.g., to break up very long 
> lines where adding whitespace outside tags would change the, 
> uh, infoset.
> 
> I personally think the distinction comes down to 
> hand-authored XML vs. 
> program-generated XML. The latter would mostly be satisifed with 
> exchanging infosets. The former, mostly not.
> 
> Here are the things human authors _need_ that aren't in the Infoset:
> 
> 1) A way to include other documents.
> 2) A way to specify characters that aren't directly supported by the 
> author's editor.
> 3) Comments. Documents are read by other humans, too.


Comments are part of the infoset.


> 4) Validation. (There's no way one could ever produce a 
> DocBook document 
> if there weren't some automatic way to check it.)
> 
> I'm kind of amused by Allesandro's harping on the need for a 
> choice of 
> attribute value delimiters. Ok, you're about to write a value that 
> contains the " character. There are two ways to do it: 
> Delimit the value 
> with ' characters (that choice we don't need) or escape the 
> value with 
> &quot; (those entities we don't need).


I have never said that you can create an XML 1.0 document without using predefined entities or apostrophes  or numeric character references.

My question was:  How relevant is that choice for the creator (does it convey any information in your intentions)?  In other words, would you be happy if that choice was put outside your control (say, if it was made automatically and rigidly by some tool)?  

(Maybe you would not be happy, but how many XML creators would be? -- especially those that are **programs**?)

I'll reword my question about general entities as follows:  Suppose you included a general entity reference somewhere (say, as shorthand for a long phrase defined as an entity).  Does the presence of that general entity reference in the document (as opposed to the expansion of the entity reference) convey any relevant information in your intentions?  In other words, would you be happy if the expansion of a general entity was put outside of your control (say, if it was made automatically and rigidly by some tool)?

What I am trying to determine is to what degree those "choices" of XML 1.0 are relevant to a creator **in that they convey information in the creator's intentions**.  (I am not suggesting in any way that XML 1.0 does not need those choices!)  If they are irrelevant to a sufficient number of creators and in different situations, then the infoset is important.

The existence of a large number of people that need to see entity references unexpanded and need to see CDATA sections as such and need to play with whitespace inside tags etc., does not prove that the infoset is not important -- unless they were the vast majority of creators, which I seriously doubt.

Alessandro


> 
> Not a problem for programs, but there are still a few of us 
> humans out 
> there, scribbling away.
> 
> Bob Foster
> 



PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.