[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: White Space

  • From: David Brownell <david-b@p...>
  • To: David Megginson <david@m...>
  • Date: Mon, 16 Aug 1999 10:07:33 -0700

sax characters white spaces
David Megginson wrote:
> 
> arkin writes:
> 
>  > A generic SAX parser has two methods of reporting character data, one
>  > clearly indicates that such character data is whitespace. What type of
>  > whitespace should be reported as whitespace? Can the application simply
>  > ignore whatever character data is reported as whitespace?
> 
> The only whitespace reported that way is whitespace in element-only
> content: that means that there has to be a DTD, and the DTD has to say
> that an element can contain only other elements.  This is a reporting
> requirement for validating parsers from the XML 1.0 recommendation.

Hmm, the XML spec never quite seemed clear about that to me.  It didn't
quite include a definition of the term "ignorable whitespace".

What about an empty element "<EMPTY>  <!-- spaces!! --> </EMPTY>" ...
isn't that "ignorable" whitespace as well?  It "must be" passed to the
app, and clearly isn't regular character text.

FWIW I concluded "ignorable" whitespace is within elements that have a
content model that's not "ANY" or a mixed content model.  That is, it's
wherever normal characters can't appear.


>  > The XML specification clearly indicates some guidelines for handling
>  > white space in a consistent manner that saves the application developed
>  > from dealing with it, and will solve all of our problems (maybe except
>  > world hunger). Would it be reasonable to define two SAX parser layers,
>  > one before and one after the white space stripping?
> 
> You can use the same API for both, but any whitespace stripping must
> be strictly at the application's discretion.

Where "application" is a fuzzy notion:  everything above the XML processor,
which could primarily consist of library code that doesn't want to give
such options to its callers.

- Dave

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i...
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@i... the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@i... the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@i...)



PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.