[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Fast text output from SAX?


compare sax and dom
Dennis Sosnoski wrote:

>> ...
>
> The problem, which I've expressed more than once, is to compare the 
> performance for the alternatives of using text XML vs. some post-parse 
> representation of XML documents. For the reasons given in my earlier 
> email I'm chosing to base my timing comparisons on the parse event 
> stream. This is in my opinion the fairest comparison to all 
> approaches, since essentially any application working with XML is 
> going to be using the output from a parse, not the raw document text. 
> SAX2 is the most widely used parser API for Java, so that's what I'm 
> using as the common event stream.

As you may remember from previous discussions, my approach in its ideal 
mode IS working with the 'raw document text'.  I think that this aspect 
of my approach is different from what everyone else is proposing even 
though we may share other strategies.  Except when converting to/from 
XML 1.0, esXML has no parsing and no serialization for a 'native' 
application (i.e. one that operates in the most optimal, esxml-aware 
mode).  That doesn't mean you can't have a standard DOM or SAX 
interface, but that esDOM is likely to be much more efficient.

To benchmark other approaches against esXML, one would have to look at 
the overall process of input/access, create/output, or 
input/process/output because the most direct analog to parsing and 
serialization with esXML simply results in raw reads and writes of large 
(tunable) blocks of data.  I am trying to optimize the overall process 
in a holistic way, a bit outside of the box in some ways.

By analogy, comparing raw SAX and DOM isn't fair because DOM is building 
a data structure.  You would have to compare SAX+data structure to DOM, 
and even better SAX+data structure+test work vs. DOM+test work to get 
meaningful data for a decision between the two.  (I'm imagining a 
business application here.)
We had a long discussion about meaningful test methodology in Santa 
Clara, but I'm not sure that it was really captured as there was little 
agreement.  Many good points were made, IMHO.

> ...
>  - Dennis

sdw

-- 
swilliams@h... http://www.hpti.com Personal: sdw@l... http://sdw.st
Stephen D. Williams 703-724-0118W 703-995-0407Fax 20147-4622 AIM: sdw



PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.