[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Searching huge xml-documents

Subject: Re: Searching huge xml-documents
From: Rick Geimer <rick.geimer@xxxxxxx>
Date: Wed, 14 Apr 1999 17:04:48 -0700
fast geimer
Could you post the URL, if there is one. I would be very interested in being
able to store and query a DOM on secondary storage.

Rick Geimer
National Semiconductor
rick.geimer@xxxxxxx

Ed Nixon wrote:

> There was a posting on Robin Cover's XML News site last week or the week
> before about an implementation of XQL from folks in Darmstadt. They have
> implemented an 'compliation' mechanism that takes the DOM tree, indexes it
> and writes to disk. At that point it's possible to run XQL against this
> file, either in memory or cached to disk.
>
> Perhaps this would be worth a look?                             ...edN
>
> > -----Original Message-----
> > From: owner-xsl-list@xxxxxxxxxxxxxxxx
> > [mailto:owner-xsl-list@xxxxxxxxxxxxxxxx]On Behalf Of Thomas Weholt
> > Sent: Wednesday, April 14, 1999 6:54 AM
> > To: xsl-list@xxxxxxxxxxxxxxxx
> > Subject: Searching huge xml-documents
> >
> >
> > hi,
> >
> > I was thinking using XML as fileformat for a CD database. Each cd can
> > containg approx. 20000 files, like clipart or source code, and
> > I need a fast ( I don`t mind waiting for 5-10 secs. to search
> > 300-500 cds (
> > max 20000 entries pr. cd )) method to pick out items
> > according to a given
> > index.
> >
> > Something like
> >
> > <cd_doc>
> >       ... info about the cd ...
> >       <entries>
> >         <entry no="1" path="/cdrom/stugg/long path/more
> > text/python_stuff.tar.gz" ... more info .../>
> >         ... 199999 or so more entries
> >       </entries>
> > </cd_doc>
> >
> > I want to search by entry no or sort by any other attribute. Perhaps
> > genererate an word-index to speed-up the process. The reason
> > I want to use
> > XML is that I use java, perl, python and other programming
> > languages on
> > several platforms. XML is readable for humans and easy to put
> > on the web.
> > My main consern is speed. Storagespace is not an issue. How
> > fast is XSL?
> > How fast is available Java packages? Any thoughts?
> >
> > I don`t even know if this is "doable", like generating
> > indexes etc., but
> > would like to use xml at least for learning purposes.
> >
> > As an experiment I created a xml-document with the structure above,
> > containing 90000 entries and searched for a given entry no,
> > using Xt and a
> > simple xsl-stylesheet. The result was a little slow. Has
> > anybody tested the
> > IBM java tools for searching, not generating html, but just
> > looking up a
> > given element in a huge document, the result ( in time ) would be
> > interesting. Xt probably has some overhead due to the fact
> > it`s written in
> > java -> starting VM and so on. If a java-app is allready
> > running, how fast
> > can I locate several elements in a given xml-document?
> >
> >
> >
> >
> >
> > ----------------------------------------------
> >               Thomas Weholt
> >        eMail : weholt@xxxxxxxxxxxxxx
> >      HTTP://www.linuxfreak.com/~weholt
> >         Phone : +47 - 92 09 59 68
> > ----------------------------------------------
> >
> >
> >
> >  XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
> >
>
>  XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.