Re: Use cases for parsing efficiency (was Re: Parsing

To: Mike Champion <mc@x...>
Subject: Re: Use cases for parsing efficiency (was Re: Parsing efficiency? - why not 'compile'????)
From: =?ISO-8859-15?Q?Bill_de_h=D3ra?= <bill.dehora@p...>
Date: Wed, 26 Feb 2003 16:06:15 +0000
Cc: xml-dev@l...
Organization: Propylon
References: <OF047F14BF.0A3C2919-ONCA256CD8.00031C17@f...> <15962.49099.318137.237747@m...> <3E5BDD04.70105@y...> <15963.57936.208121.931029@m...> <oprk59i5y1ezizxn@s...> <15963.65069.422647.95309@m...> <3E5C9518.1050402@e...> <15964.46767.897161.524921@m...> <oprk7i1rxzezizxn@s...>
Reply-to: bill.dehora@p...
User-agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.1) Gecko/20020826

Play the video

Mike Champion wrote:

> My day job colleagues changed my mind by pointing out that in 
> industrial- strength, native XML processing environments, nothing much 
> is happening besides XML being parsed, processed (stored, queried, 
> transformed) and serialized again.  

That's quite a lot happening (other than parsing). I mean what else 
/could/ happen?

> The better code gets and the more 
> efficient customers get in using the code (e.g. building DB indexes and 
> optimizing queries, in our case),the more and more that 
> parse/serialization step becomes a bottleneck.  I've heard the same 
> thing from industrial-strength SOAP developers -- as the volume of 
> messages goes up and processing resources get dedicated to XML (i.e., no 
> application logic or DB access happening on the machine parsing, 
> processing, serializing the XML), then the bottlenecks in XML parsing 
> become increasingly apparent.  Sure, Father Moore will ultimately solve 
> this problem with faster hardware, but that's not a great marketing 
> pitch for software people.

So, following David, in one hundred secs, you spend one second 
parsing XML and 99 seconds doing somehting else. Suppose you get a 
tenfold speedup doing something else (cigars all round). You're down 
to 11 seconds. Parsing is approaching 10%. A tenfold speedup in 
parsing only saves you 1/2 a second, or approaching 9%, now. And 
because it's /still/ the wrong side of the 80/20 split, it's /still/ 
  not place to be looking, unless you know that processing time is 
evenly distributed through the code base (but that would be rare, 
and probably worth writing a paper on). The same reasoning applies 
at 10% time to begin with.

> So, I'm not at all sure that standardization of efficient infoset 
> serializations is something that the W3C or anyone else should undertake 
> at this time. But I don't want to see the W3C preclude it (or XML geeks 
> to conclude that it is evil) either.  XML processing is moving more and 
> more into the core of real enterprises. We'll see the previous situation 
> where XML is just a transient serialization format between DBs and 
> applications turned around, so that most of the components of a 
> processing pipeline are taking XML in, storing/processing it natively,  
> and putting XML out.    In that scenario, lots of people are going to be 
> looking for ways to reduce the parsing bottlenecks ... 

Performance arguments mean nothing without measurement. And even if 
parsing is a problem, it does not follow that XML requries 
subsetting. For example, you might be better off with an 'enterpise 
class parser', or an 'enterpise class datamodel' than with an 
'industrial class subset'.

I'd like to see some some numbers on parsing concerns, so we could 
figure out a) is there a problem, b) where is a problem, c) what's 
the solution.

Bill de hÓra

Follow-Ups:
- Re: Use cases for parsing efficiency (was Re: Parsing efficiency? - why not 'compile'????)
  - From: "Alaric B. Snell" <alaric@a...>

References:
- Parsing efficiency? - why not 'compile'????
  - From: Matthew.Bennett@f...
- re: Parsing efficiency? - why not 'compile'????
  - From: David Megginson <david@m...>
- Re: Parsing efficiency? - why not 'compile'????
  - From: "J.Pietschmann" <j3322ptm@y...>
- Re: Parsing efficiency? - why not 'compile'????
  - From: David Megginson <david@m...>
- Re: Parsing efficiency? - why not 'compile'????
  - From: Mike Champion <mc@x...>
- Re: Parsing efficiency? - why not 'compile'????
  - From: David Megginson <david@m...>
- Re: Parsing efficiency? - why not 'compile'????
  - From: Robin Berjon <robin.berjon@e...>
- Re: Parsing efficiency? - why not 'compile'????
  - From: David Megginson <david@m...>
- Use cases for parsing efficiency (was Re: Parsingefficiency? - why not 'compile'????)
  - From: Mike Champion <mc@x...>

Prev by Date: RE: The subsetting has begun
Next by Date: RE: The subsetting has begun
Previous by thread: Use cases for parsing efficiency (was Re: Parsingefficiency? - why not 'compile'????)
Next by thread: Re: Use cases for parsing efficiency (was Re: Parsing efficiency? - why not 'compile'????)
Index(es):
- Date
- Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Subscribe in XML format

RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >