[XQuery Talk Mailing List Archive Home] [By Date] [By Thread] [By Subject] [By Author] [Recent Entries] [Reply To This Message]

Benchmarking on XQuery Implementation in DBMS

Michael Kay mike at saxonica.com
Thu Oct 29 09:09:48 PST 2009


  Benchmarking on XQuery Implementation in DBMS
A fair bit of work has been done but more is always needed.
 
I think you need to start out with very clear objectives, especially in the
following areas:
 
* are you going to try and measure how well the products handle something
close to a "real-life" workload (including factors such as updating,
database loading, etc), or are you going to measure a small number of
synthetic queries?
 
* are you going to make strenuous efforts to get the best possible
performance out of each product that you test, or are you simply going to
measure what it achieves "out of the box"?
 
* do you just want to produce some numbers, or do you want to use those
numbers to test some theories or hypotheses?
 
Most of the papers I've seen on XSLT, XPath, or XQuery benchmarks over the
years suffer from one or more of the following drawbacks:
 
* Some of the queries they measure are totally unrepresentative of real
life, for example IIRC one benchmark has a query like
/descendant::*/ancestor::*/descendant::*. Who cares whether that performs
well? (But it can be useful, of course, if your objective is to find out how
path expressions are implemented internally).
 
* Simple flaws in the measurement methodology, for example failing to take
account of Java VM startup time, or to distinguish query compilation,
document parsing, and query evaluation costs. Such flaws can make your
numbers meaningless, but they are surprisingly common in published papers.
 
* Running products in a sub-optimal way. This might say something about the
product's usability or documentation, but if you run with the wrong
configuration options, it tells you very little about the product's true
performance capabilities. This is especially true where your knowledge of
one of the products is much better than your knowledge of others: a factor
that taints all comparative benchmarks produced by vendors.
 
* Making incorrect inferences from the measurements (for example, the
authors of one paper drew inferences about the internal workings of Saxon
which were completely and bizarrely wrong, and they didn't check them with
me before publication.)
 
It's worth remembering that in real life, the performance you achieve
depends much more on your skills as a developer and your knowledge and
experience of the technology you are using than on the absolute capabilities
of the chosen product. Performance problems in real life don't occur because
the technology is intrinsically slow, they occur because it is misused. This
places a severe limitation on the usefulness of benchmark numbers.
 
Regards,

Michael Kay
http://www.saxonica.com/
http://twitter.com/michaelhkay 


  _____  

From: http://x-query.com/mailman/listinfo/talk [mailto:http://x-query.com/mailman/listinfo/talk] On Behalf
Of Wely Live
Sent: 29 October 2009 02:57
To: http://x-query.com/mailman/listinfo/talk
Subject:  Benchmarking on XQuery Implementation in DBMS


Hi,
 
I am planning to research the comparison and benchmarking XQuery
Implementation in DBMS (SQL Server, Oracle, DB2). There are a few question
to discuss as following:
1. I wonder if anybody has ever done the similar research?
    I would be appreciate if anybody could provide my any related source or
document.
2. To do the benchmarking, we of course need a sample dataset. Anybody has
reference where can I find a good dataset sample?
 
Thanks!
 
Regards,
Wely
Post Graduate Student
Nanyang Technological University

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://x-query.com/pipermail/talk/attachments/20091029/babab73b/attachment.htm


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2007 All Rights Reserved.