[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: The triples datamodel -- was Re: SemanticWeb per


extract xml triples from language
Elliotte Rusty Harold wrote:
> At 1:11 PM +0200 6/5/04, Bjoern Hoehrmann wrote:

[a given XPath being useful to extract a certain thing from a certain 
URL's response to GET]

>> How do you know?
> 
> View source.

That's the thing. Viewing the source once (or, indeed, N times) and 
seeing a pattern (today's stories being at //html:today) and assuming it 
will work in future is, indeed, a rather informal kind of schema.

At least, by my definition of schema as learnt from the database world, 
which is something like "a convention on how a given abstract piece of 
information is represented" - in this case, I'm not talking about schema 
in the sense perhaps more normally found in XML, as a "validity constraint".

As well as that XPath, there's probably more to the informal schema 
being used here - unless the software that uses that XPath to extract 
today's news is a totally generic XSLT/CSS/etc supporting XML browser, 
then there's probably also an assumption that it's in XHTML, and that 
it's human-readable text in some human language as well (perhaps even an 
assumption of it being a specific dialect of English).

Information about the structure of a site gleaned from viewing the 
source may be subject to random change; if the site published a schema 
(be it a formal machine-readable schema or a paragraph of text like 
above), they would then have the opportunity to also state how far users 
can rely on that not changing in future. They may lie, of course, but 
people will have more cause to complain if they "said" they wouldn't 
change it; so when some software that relies on it breaks, the author of 
the software can say "Hey! The news site broke its promise" rather than 
"Uh, I made an assumption that no longer holds"...

ABS


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2011 All Rights Reserved.