[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Parsing without resolving entities

  • From: David Carlisle <davidc@n...>
  • To: rmcgarvey@g...
  • Date: Mon, 29 Oct 2007 17:06:21 GMT

Re:  Parsing without resolving entities

It depends a bit on what you are going to do with the document, and in
particular whether you are using an XML API that can support undefined
entity references. if you are (DOM for example) all you need is to
arrange that the DTD that defines the entities is not read.
If you are not (and most XSLT processing for example requires the
entities expanded as the xpath data model does not support undefined
entities) then you need to remove them somehow, perhaps in a form that
lets them be replaced.

As you suggest, preprocessing to hide the ampersand works (especially
for more complicated entities that you could not re-constitute just from
the character data). 

Or you can modify the dtd so that &mdash; expaands to &amp;mdash; and
then you don't need to change the document on input (but do still need
to post process the result to get rid of the extra quoting.

or (perhaps) you can let all the entities expand but then finally
serialise the data using entities rather than characters where possible
(for example XSLT will do this if writing html, or XSLT2 you can specify
a character map (eg
that will do the same thing. Note that this doesn't preserve the
original entities, juist uses entities wherever possible, whether or not
the input used that form.


The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.

This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs. 

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
First Name
Last Name
Subscribe in XML format
RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.