[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: What is XML for?

  • From: David Megginson <david@m...>
  • To: "XML Developers' List" <xml-dev@i...>
  • Date: Fri, 29 Jan 1999 16:33:09 -0500 (EST)

what is xml parsing
Tim Bray writes:

 > At 03:27 PM 1/29/99 -0500, David Megginson wrote:
 > >In general, few high-speed, large-scale applications can afford
 > >repeated passes through serial text files (or even random access
 > >through reverse indices), so using XML (in the literal sense) for
 > >primary storage is impractical; there are, of course, exceptions --
 > >for example, small bits of XML can be stored as blobs in relational
 > >databases.
 > 
 > Well, I'm not sure.  Perhaps it's just because my perceptions were
 > formed by working on the 500-MB deeply recursive Oxford English
 > Dictionary text; but I think that a high-performance repository
 > that could accurately mimic the data structures observed in XML
 > would very useful in many (not all, obviously) applications.  I 
 > think I hear both Megginson and Winer expressing doubt on that 
 > front.  I'm surprised.  -Tim

No need for surprise; in fact, I agree with Tim: a high-performance
repository would be a Very Good Thing.  

Either this thread or another, closely-related one started with a
complaint that parsing an XML file over and over again for each
request put too heavy a load on a server; there was then a suggestion
that the XML could be precompiled into memory, or even stored in an
RDBM, at which last point a participant expressed regret that it was
no longer a pure XML solution.

My point, and (I think) Paul's, is that XML defines an external
representation for hierarchical information, not (except in very small
systems) an internal representation, but that many people are now
trying to use XML internally in inappropriate ways (such as parsing
the same static file several times each second).  My second point is
that the problem of storing, searching, and retrieving hierarchical
information long predates SGML and XML and is not XML-specific (though
it can come in an XML flavour if desired).

A high-performance, XML-aware repository would be a good thing,
because it could round-trip generic XML without significant loss.  So
there.


All the best,


David

p.s. I followed the OED work from Waterloo very closely in the late
     1980's (Frank Tompa might remember me) while I was programming
     the search engine for the 30MB Dictionary of Old English corpus,
     an hour away in Toronto; I even went to the point of trying to
     write some of my own optimised Patricia-tree implementations
     using the OED's algorithm -- nice idea for static repositories,
     but very brittle otherwise.

-- 
David Megginson                 david@m...
           http://www.megginson.com/

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i...
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@i... the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@i... the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@i...)


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.