[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: use XSLT or XQuery in Saxon?

Subject: RE: use XSLT or XQuery in Saxon?
From: "Michael Kay" <mike@xxxxxxxxxxxx>
Date: Thu, 6 Jan 2005 12:10:46 -0000
custom transformer
> > I have extremely large (over 300 MB) XML file and tens
> > of thousands of small xml files generated after
> > applying various XSLT on the one big XML file.
> 
> I don't know whether Mr Kay have tested Saxon with 100+MB 
> files or not, but we 
> did (6.5.?), and could not get a simple transform to complete 
> within hours (I 
> think we gave up after ~4hours on a 80-100MB file), on a 
> machine with 1GB of RAM.

I've only gone up to about 50Mb myself, but I know of users who've gone up
to 200Mb.

For one Saxonica client I managed to get the processing time for a 40Mb
transformation down from 90 minutes to 45 seconds. Once you've allocated
enough memory, if it still takes hours then it's because there's a
non-linearity in the stylesheet logic, and this can usually be eliminated by
careful use of keys, sorting, or grouping.

But I do agree with you that there are some problems that are better tackled
with a SAX-based Java application: or sometimes a SAX filter as a precursor
to an XSLT transformation.

Michael Kay
http://www.saxonica.com/
 
> 
> I wrote a custom transformer in Java doing exactly what we 
> needed using;
>  *  SAX events
>  *  Only keeping one branch/leaf of the XML tree in memory at 
> any time.
>  *  Aggregation of content into small mutable value objects, 
> which were output 
> and discarded when completed.
> 
> 1500 files, varying from 360MB to ~10MB of a total of ~10GB 
> could be processed 
> in a linear speed of ~2MB per second, or close to the disk 
> drive speed, on a 
> dual CPU workstation.
> 
> I suspect that you will end up in 'custom transformer' 
> territory, but perhaps 
> Saxon has improved and can deal with the transforms you give 
> it. I suggest 
> that you make some simple tests first, which somewhat 
> ressemble what you need 
> to do later.
> 
> 
> Cheers
> Niclas
> -- 
> ---------------
> If at first you don't succeed, destroy all evidence that you tried.
>  -  Steven Wright
> 
> +---------//-------------------+
> |   http://www.dpml.net        |
> |  http://niclas.hedhman.org   |
> +------//----------------------+

Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.