Re: Processing huge XML files
Thomas Lee <ytlee@c...> writes: > We need to process very large XML files (up to a few ten MB). So we > can't process the whole file with DOM in memory. We are prepared to > use SAX to parse a large XML file and store into our data structure in > the disk (propietary data structure or RDBMS). We'll also provide a > set of APIs for access the contents in the XML file. Now the XML > content change is not required. In the computational linguistics domain, we regularly work with XML documents in the 100MB .. GB range. For many of our applications, streaming processing using a hybrid pull/tree-fragment API allows very efficient processing -- see our LT XML toolkit  and a paper about its use . ht  http://www.ltg.ed.ac.uk/software/xml/  http://www.ltg.ed.ac.uk/~dmck/Papers/chum.ps -- Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh W3C Fellow 1999--2002, part-time member of W3C Team 2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440 Fax: (44) 131 650-4587, e-mail: ht@c... URL: http://www.ltg.ed.ac.uk/~ht/ [mail really from me _always_ has this .sig -- mail without it is forged spam]
PURCHASE STYLUS STUDIO ONLINE TODAY!
Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!
Download The World's Best XML IDE!
Accelerate XML development with our award-winning XML IDE - Download a free trial today!
Subscribe in XML format