[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: use XSLT or XQuery in Saxon?

Subject: Re: use XSLT or XQuery in Saxon?
From: Niclas Hedhman <niclas@xxxxxxxxxxx>
Date: Thu, 6 Jan 2005 18:21:10 +0800
niclas hedhman
On Thursday 06 January 2005 06:53, alan m wrote:

> I have extremely large (over 300 MB) XML file and tens
> of thousands of small xml files generated after
> applying various XSLT on the one big XML file.

I don't know whether Mr Kay have tested Saxon with 100+MB files or not, but we 
did (6.5.?), and could not get a simple transform to complete within hours (I 
think we gave up after ~4hours on a 80-100MB file), on a machine with 1GB of 
RAM.

I wrote a custom transformer in Java doing exactly what we needed using;
 *  SAX events
 *  Only keeping one branch/leaf of the XML tree in memory at any time.
 *  Aggregation of content into small mutable value objects, which were output 
and discarded when completed.

1500 files, varying from 360MB to ~10MB of a total of ~10GB could be processed 
in a linear speed of ~2MB per second, or close to the disk drive speed, on a 
dual CPU workstation.

I suspect that you will end up in 'custom transformer' territory, but perhaps 
Saxon has improved and can deal with the transforms you give it. I suggest 
that you make some simple tests first, which somewhat ressemble what you need 
to do later.


Cheers
Niclas
-- 
---------------
If at first you don't succeed, destroy all evidence that you tried.
 -  Steven Wright

+---------//-------------------+
|   http://www.dpml.net        |
|  http://niclas.hedhman.org   |
+------//----------------------+

Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.