[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: SAX and parallel processing

  • To: Uche Ogbuji <Uche.Ogbuji@f...>
  • Subject: Re: SAX and parallel processing
  • From: Alan Gutierrez <alan-xml-dev@e...>
  • Date: Sat, 1 Jan 2005 15:47:04 -0500
  • Cc: Bob Foster <bob@o...>, David Megginson <david.megginson@g...>, xml-dev@l...
  • In-reply-to: <1104611594.3038.153.camel@borgia>
  • Mail-followup-to: Uche Ogbuji <Uche.Ogbuji@f...>,Bob Foster <bob@o...>,David Megginson <david.megginson@g...>, xml-dev@l...
  • References: <200412310131.52268.miles@m...> <1104460276.3038.23.camel@borgia> <41D4CC8C.1080200@o...> <1104465543.3038.28.camel@borgia> <20041231165744.GA20756@m...> <75cb920c04123110151af471f9@m...> <41D5DAD0.7000909@o...> <20041231234449.GA21911@m...> <41D5F0D0.60501@o...> <1104611594.3038.153.camel@borgia>
  • User-agent: Mutt/1.4.1i

python parallel processing
* Uche Ogbuji <Uche.Ogbuji@f...> [2005-01-01 15:33]:
> On Fri, 2004-12-31 at 16:37 -0800, Bob Foster wrote:
> > Right. In order to process a SAX stream in parallel you have to copy the 
> > data in the stream, you can't just "forward" the events. You also have 
> > to instantiate a context for each event, including at least the 
> > namespaces in scope, the Location info. I didn't mean to imply this 
> > would be excessively expensive, just not as lightweight as serially 
> > processed SAX.
> 
> Maybe this is where my perspective, so surprising to so many here, comes
> from.  In Python SAX, all event objects are dissociated from the driver.
> 
> I must say I think it somewhat vindicates the Python approach that it so
> easily extends the framework to advanced implementation strategies.
> 
> I had forgotten this about the original SAX, and I must say it makes for
> a lot more of an "assembler-level" view than SAX as I'm used to using
> it.

    In SAX Strategy, the events are assicated with the driver, more
    or less, until a need for an immutable copy of the data arrises.

    As I noted, before about Characters, also Attributes, a resuable
    object is used, and Characters indexes into the parse buffer,
    but if you need to keep a copy it's like.

    Characters characters = (Characters) event.getCharacters()
                                              .getImmutable();

    Or a more generic.

    Lexeme lexeme = event.getLexeme().getImmutable();

    SAX raw is basically an API for connecting handlers. A few more
    gew-gaws, and the obverer/event pattern is much easier to
    impelment.

--
Alan Gutierrez - alan@e...

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.