[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: progressive parsing of XML. Any technology out there?

  • To: 'Guillaume Lebleu' <gl@b...>
  • Subject: RE: progressive parsing of XML. Any technology out there?
  • From: "Thomas, Mark - BLS CTR" <Thomas.Mark@b...>
  • Date: Wed, 16 Jun 2004 16:12:34 -0400
  • Cc: 'XML Developers List' <xml-dev@l...>

progressive parsing
If I understand you correctly, you want to stream-parse a potentially huge
XML document so as not to put it all in memory, but you want to identify
certain branches to be tree-parsed and/or validated, etc?

If you're not averse to using Perl, the XML::Twig module is designed to do
exactly that, and easily.

The site (http://www.xmltwig.com/) seems to be down right now but there's a
tutorial here: http://www.xml.com/pub/a/2001/03/21/xmltwig.html and module
documentation here: http://search.cpan.org/~mirod/XML-Twig-3.15/Twig.pm

- Mark.

> -----Original Message-----
> From: Guillaume Lebleu [mailto:gl@b...] 
> Sent: Wednesday, June 16, 2004 3:02 PM
> To: xml-dev@l...
> Subject:  progressive parsing of XML. Any technology 
> out there?
> 
> 
> Hello,
> 
> Assuming I receive a very large XML document coming in 
> through HTTP, for instance, a big SOAP message that embeds 
> other XML documents and unstructured data in base64 encoded 
> values, and let's say I want to validate information in my 
> SOAP Header and return a response as fast as possible, but I 
> want to process (esp. validate) the rest of my XML document 
> asynchronously.
> 
> What I would like to do is control the branches of my tree I 
> want to parse and to what depth (hence the term "progressive 
> parsing"), and end up with some objects that point to 
> unparsed xml for the branches and depth I don't need for my 
> preliminary processing.
> 
> Of course, I want something that is pretty dynamic and does 
> not require low level SAX coding on the application developer.
> 
> (Right now, the only way out there is to use things like 
> SOAP+Attachments, where you actually are not using XML so 
> that there is an explicit separation between multiple XML 
> documents that can be parsed separately, but this approach 
> then makes a design when you have XML documents containing 
> pointers to other documents in the MIME message received that 
> needs to be resolved, etc.).
> 
> Are there technologies out there to do this in a better way?
> 
> Thanks
> 
> Guillaume
> 
> 
> -----------------------------------------------------------------
> The xml-dev list is sponsored by XML.org 
> <http://www.xml.org>, an initiative of OASIS 
<http://www.oasis-open.org>

The list archives are at http://lists.xml.org/archives/xml-dev/

To subscribe or unsubscribe from this list use the subscription
manager: <http://www.oasis-open.org/mlmanage/index.php>


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.