[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: The triples datamodel -- was Re: Semantic Web pe

  • To: 'Elliotte Rusty Harold' <elharo@m...>, XML Developer List <xml-dev@l...>
  • Subject: RE: The triples datamodel -- was Re: Semantic Web permathread, iteration n+1
  • From: "Bullard, Claude L (Len)" <len.bullard@i...>
  • Date: Tue, 8 Jun 2004 09:15:30 -0500

what schema triples means
That's good.  Except the bit about costs going up.  Why 
would they?

A schema can be a guardian or a classification verifier. 
One might assume, rightly or wrongly that the MIME type or 
the extension or a magic number or a DOCTYPE tells 
one the class.  One might have to verify that.  

Also, this application of a schema is only one of several possible.   

I agree that forcing widespread use of a schema is a tough political 
problem but it is a trivial technical issue.  One says, "this is 
the Internet, after all" but one means "these are humans 
after all".   Humans often fail Turing tests.

len

The trick to passing a Turing test is selecting the topic 
of conversation wisely.  Eagerness is everything.

From: Elliotte Rusty Harold [mailto:elharo@m...]

>On Jun 8, 2004, at 12:31 AM, Rick Marshall wrote:
>
>>  and if the schema changes, but not the xslt, and someone suffers 
>>financial loss - tax returns fail, orders lost, etc - who pays?

Perhaps there's a technical step in the proposed system you're 
missing here. When receiving a document you first have to classify 
it. That is, you must figure out if this is a kind of document you've 
seen before, and if you have tools in place to process it 
automatically. If you do, then dispatch it to one of those tools. If 
not, dispatch it to a human for further analysis.

We can adjust how tight we make the recognition software. Personally, 
I like loose, XPath based solutions like Schematron that ask whether 
the document contains the information I want rather than asking 
whether it tightly fits some W3C XML Schema Language schema. However, 
if you want to use a conservative schema (everything not permitted is 
forbidden) as your diagnosis, go ahead. You won't be able to process 
quite as much automatically, and costs will go up; but maybe in your 
environment and for your processes safety concerns do mandate that. 
We can also have a middle ground, where XPath extracts the relevant 
fragments of a document, and then each of these fragments we use is 
validated closely without worrying about the outer envelope. And 
there are lots of other points along the continuum as well.

However, the really key idea is to use the schema, in whatever 
language, as a classification tool, not a guardian. The schema's job 
is to sort documents into the right queue, not to accept some 
documents unconditionally and reject all others.

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.