[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Exploiting multi-core CPUs during XML parsing


thunderbird xml parsing error
> Sounds a nice idea independent of any speed benefits. I toyed with the idea
> of parsing XSLT match pattterns backwards at one stage, but didn't pursue
> it. My brain is wired for left-to-right reading...
> 
> As a matter of interest, is there any problem with decoding UTF-8 when
> reading backwards?
> 
> Shame that the spec doesn't require ">" to be escaped, I imagine this causes
> a fair problem with backtracking, for example how do you cope with
> 
> ...... <a><b/><c/></a>  -->
> 
> which might or might not be part of a comment?
> 
> IIRC we were able to read files backwards on ICL VME. That's history, but
> many of its features have reappeared in Windows 25 years later ...

I wrote a reverse parser in pascal once as part of an editor project. It 
was designed to determine the current context the user was working in. 
In general from a fixed space somewhere in the middle of a document the 
amount of branching and caching you had to do was excessive and in 
virtually all cases led to scans back to the start of the document. In 
Sean's case I imagine there would still be a lot of assumptions on the 
parser's part until it reached the crossover position between the two 
threads at which point it might issue a fatal error (if the assumptions 
turned out to be wrong).

I wonder if you could instead work on a sequential multi-threaded 
approach. 1 to handle decoding and chunking of characters another one to 
handle parsing of lexical structures and a third (possibly at a driver 
level) to handle external WF checks (like checking character classes, 
name checks, and duplicate attribute checks). Decoupling these pieces 
would allow you to very easily turn off WF checks if you knew that the 
document was WF via an out-of-band mechanism.


Cheers,
Jeff Rafter

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.