[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: parsing large xml files using Saxon 6.5.2

Subject: Re: parsing large xml files using Saxon 6.5.2
From: Mukul Gandhi <mukulw3@xxxxxxxxx>
Date: Mon, 11 Aug 2003 07:30:41 -0700 (PDT)
parsing large xml files
Hi Marina,
 If you will overload the XSLT processor, *it is bound
to take time*. The process may even hang(as you are
reporting). An XML file of size 18.1Mb will certainly
take lot of resources. If possible you may split the
original XML file into smaller ones, and then apply
XSL to the smaller files. It may run fast. 

If you can give more details about your XML and the
XSL you are trying, probably somebody can give you
ideas to improve performance of the transformations.

Regards,
Mukul


--- marina <marina777uk@xxxxxxxxx> wrote:
> Hi,
> 
> I am having problems parsing some xml files.I have a
> 1ghz processor and 256Meg Ram.
> 
> The xslt stylesheet "wordgroup.xsl" from Dimitri
> (thankyou!) wwas tested
> and worked perfectly on smaller test files. When I
> run
> it on a larger file
> "1cl.xml" = 18.1Mb it builds the tree for
> str-Split-to-words.xsl and then sits there for
> hours.
> 
> See output below.
> 
>
---------------------------------------------------------------------------------
> Microsoft Windows 2000 [Version 5.00.2195]
> (C) Copyright 1985-2000 Microsoft Corp.
> 
> h:\saxon\testbed>saxon -t -o output.txt 1cl.xml
> wordgroup.xsl
> SAXON 6.5.2 from Michael Kay
> Java version 1.1.4
> Preparation time: 371 milliseconds
> Processing file:/h:/saxon/testbed/1cl.xml
> Building tree for file:/h:/saxon/testbed/1cl.xml
> using
> class com.icl.saxon.tinyt
> ree.TinyBuilder
> Tree built in 7070 milliseconds
> Building tree for
> file:/h:/saxon/testbed/strSplit-to-Words.xsl using
> class com.i
> cl.saxon.tinytree.TinyBuilder
> Tree built in 10 milliseconds
> 
>
---------------------------------------------------------------------------------
> 
> 
> So I made another xml file "little.xml" by pasting
> sections of 1cl.xml in different sizes to see 
> 
> where it was having problems processing.
> 
> little.xml = 1.4Mb time = 1.2sec
> little.xml = 4.4Mb time = 3.3 sec
> little.xml = 7.3Mb time = 6 sec
> little.xml = 10.3Mb time = 9.8 sec
> little.xml = 19.2 Mb (bigger than the file I want to
> parse!) time = 26.1 sec! (see nice output 
> 
> below)
> 
> 
> h:\saxon\testbed>saxon -t -o output.txt little.xml
> wordgroup.xsl
> SAXON 6.5.2 from Michael Kay
> Java version 1.1.4
> Preparation time: 701 milliseconds
> Processing file:/h:/saxon/testbed/little.xml
> Building tree for file:/h:/saxon/testbed/little.xml
> using class com.icl.saxon.ti
> nytree.TinyBuilder
> Tree built in 7912 milliseconds
> Building tree for
> file:/h:/saxon/testbed/strSplit-to-Words.xsl using
> class com.i
> cl.saxon.tinytree.TinyBuilder
> Tree built in 20 milliseconds
> Execution time: 26178 milliseconds
> 
> Any ideas for me to try?
> 
> Thanks
> 
> Marina
> 
> 
> 
> __________________________________
> Do you Yahoo!?
> Yahoo! SiteBuilder - Free, easy-to-use web site
> design software
> http://sitebuilder.yahoo.com
> 
>  XSL-List info and archive: 
> http://www.mulberrytech.com/xsl/xsl-list
> 


__________________________________
Do you Yahoo!?
Yahoo! SiteBuilder - Free, easy-to-use web site design software
http://sitebuilder.yahoo.com

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.