RE: Searching huge xml-documents
This will get you there and supply you with some added value: http://metalab.unc.edu/xql/ > -----Original Message----- > From: owner-xsl-list@xxxxxxxxxxxxxxxx > [mailto:owner-xsl-list@xxxxxxxxxxxxxxxx]On Behalf Of Rick Geimer > Sent: Wednesday, April 14, 1999 8:05 PM > To: xsl-list@xxxxxxxxxxxxxxxx > Subject: Re: Searching huge xml-documents > > > Could you post the URL, if there is one. I would be very > interested in being > able to store and query a DOM on secondary storage. > > Rick Geimer > National Semiconductor > rick.geimer@xxxxxxx > > Ed Nixon wrote: > > > There was a posting on Robin Cover's XML News site last > week or the week > > before about an implementation of XQL from folks in > Darmstadt. They have > > implemented an 'compliation' mechanism that takes the DOM > tree, indexes it > > and writes to disk. At that point it's possible to run XQL > against this > > file, either in memory or cached to disk. > > > > Perhaps this would be worth a look? > ...edN > > > > > -----Original Message----- > > > From: owner-xsl-list@xxxxxxxxxxxxxxxx > > > [mailto:owner-xsl-list@xxxxxxxxxxxxxxxx]On Behalf Of Thomas Weholt > > > Sent: Wednesday, April 14, 1999 6:54 AM > > > To: xsl-list@xxxxxxxxxxxxxxxx > > > Subject: Searching huge xml-documents > > > > > > > > > hi, > > > > > > I was thinking using XML as fileformat for a CD database. > Each cd can > > > containg approx. 20000 files, like clipart or source code, and > > > I need a fast ( I don`t mind waiting for 5-10 secs. to search > > > 300-500 cds ( > > > max 20000 entries pr. cd )) method to pick out items > > > according to a given > > > index. > > > > > > Something like > > > > > > <cd_doc> > > > ... info about the cd ... > > > <entries> > > > <entry no="1" path="/cdrom/stugg/long path/more > > > text/python_stuff.tar.gz" ... more info .../> > > > ... 199999 or so more entries > > > </entries> > > > </cd_doc> > > > > > > I want to search by entry no or sort by any other > attribute. Perhaps > > > genererate an word-index to speed-up the process. The reason > > > I want to use > > > XML is that I use java, perl, python and other programming > > > languages on > > > several platforms. XML is readable for humans and easy to put > > > on the web. > > > My main consern is speed. Storagespace is not an issue. How > > > fast is XSL? > > > How fast is available Java packages? Any thoughts? > > > > > > I don`t even know if this is "doable", like generating > > > indexes etc., but > > > would like to use xml at least for learning purposes. > > > > > > As an experiment I created a xml-document with the > structure above, > > > containing 90000 entries and searched for a given entry no, > > > using Xt and a > > > simple xsl-stylesheet. The result was a little slow. Has > > > anybody tested the > > > IBM java tools for searching, not generating html, but just > > > looking up a > > > given element in a huge document, the result ( in time ) would be > > > interesting. Xt probably has some overhead due to the fact > > > it`s written in > > > java -> starting VM and so on. If a java-app is allready > > > running, how fast > > > can I locate several elements in a given xml-document? > > > > > > > > > > > > > > > > > > ---------------------------------------------- > > > Thomas Weholt > > > eMail : weholt@xxxxxxxxxxxxxx > > > HTTP://www.linuxfreak.com/~weholt > > > Phone : +47 - 92 09 59 68 > > > ---------------------------------------------- > > > > > > > > > > > > XSL-List info and archive: > http://www.mulberrytech.com/xsl/xsl-list > > > > > > > XSL-List info and archive: > http://www.mulberrytech.com/xsl/xsl-list > > > XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list > XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
PURCHASE STYLUS STUDIO ONLINE TODAY!
Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!
Download The World's Best XML IDE!
Accelerate XML development with our award-winning XML IDE - Download a free trial today!
Subscribe in XML format