[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: regular expression on xml

  • From: Liam R E Quin <liam@w3.org>
  • To: ycao5@scs.carleton.ca
  • Date: Tue, 26 Jan 2010 18:34:59 -0500

Re:  regular expression on xml
On Tue, 2010-01-26 at 15:47 -0500, ycao5@scs.carleton.ca wrote:

>       In my xml application, I want to write a small parser which can 
> include useful parts in an xml document but ignore the rest. The  
> overhead of sax/dom parser is large. So is it reasonable to use  
> regular expressions to parse xml?

A sax parser should not give you a large overhead, as it doesn't
build a tree (you do that yourself) and doesn't use much memory.

You can use regular expressions (and XML was defined with this in
mind) as long as you know there are no CDATA sections... otherwise
it tends to get too hairy too quickly to be useful.  (I am assuming
pcre/java style extended regular expressions of course).  Commented-out
markup can cause problems too.

But it is likely to be faster to use a proper native C XML parser.

A parser that builds a DOM is likely overkill if you are only using
a small fraction of the document.

Liam

-- 
Liam Quin - XML Activity Lead, W3C, http://www.w3.org/People/Quin/
Pictures from old books: http://fromoldbooks.org/
Ankh: irc.sorcery.net irc.gnome.org www.advogato.org



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.