[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: XML-enabled databases, XQuery APIs


xml enabled databases pdf
> I'm neutral here, but for an opposing view see "XML Parsing: 
> A Threat to 
> Database Performance" [1] by Matthias Nicola of IBM. From the 
> abstract:
> 
> "XML parsing is generally known to have poor performance 
> characteristics 
> relative to transactional database processing. Yet, its potentially 
> fatal impact on overall database performance is being 
> underestimated. We 
> report real-word database applications where XML parsing 
> performance is 
> a key obstacle to a successful XML deployment..."
> 
> -- Ron

Thanks for the reference. The paper puzzles me, though. In section 4.1 they
say that:

* parsing a 100K document typically takes 175K instructions

* inserting a row into a relational table requires 30K to 200K instructions.

I would have expected that in a system using shredding, a 100K XML document
would result in say 200 rows being added to the database, giving an insert
cost of 6M to 40M instructions compared with a parsing cost of 175K
instructions, broadly in line with the "two orders of magnitude" that I
guessed in my original post. Clearly I'm thinking of a different kind of
application from the one that they studied.

Michael Kay
http://www.saxonica.com/



> 
> [1] 
> http://lists.w3.org/Archives/Public/www-ws/2004Oct/att-0032/MN
> icola_CIKM_2003_1_.pdf
> 
> Michael Kay wrote:
> > I would be very surprised if XML parsing contributes 
> anything noticeable to
> > the cost of a database load (in shredding mode). Except 
> possibly for a
> > pathological XML document containing 3 nodes and 3 billion bytes.
> > 
> > I haven't looked at the latest products from MS or Oracle, 
> but my experience
> > of database loading with complex data and a realistic level 
> of indexing is
> > that it's a couple of orders of magnitude slower than XML 
> parsing. You can
> > improve that with a custom loader that bypasses SQL and 
> does a lot of
> > heavy-duty sorting and merging to minimize head movement on 
> the disk (does
> > anyone still do that?), but I think it's still true that 
> the parsing cost is
> > immaterial.
> 
> 
> -----------------------------------------------------------------
> The xml-dev list is sponsored by XML.org <http://www.xml.org>, an
> initiative of OASIS <http://www.oasis-open.org>
> 
> The list archives are at http://lists.xml.org/archives/xml-dev/
> 
> To subscribe or unsubscribe from this list use the subscription
> manager: <http://www.oasis-open.org/mlmanage/index.php>
> 
> 



PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.