[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: Data mining the semantic web? (was RE: Se mantic


rdf data mining
Len, I agree with what you say below.

To be clear, RDF is not _itself_ an ontology language. 
But one can develop an ontology language e.g. DAML or 
OWL which is based upon RDF, i.e. is written in RDF 
syntax (triples) and at the same time XML (as a dialect 
of RDF/XML).

At the same time, via XML datatypes, one can speak about 
(e.g. make assertions about, or classifications of) 
pieces of structured data in OWL.

In RDF you can assign a type (i.e. place into a 
classification) with an "rdf:type" arc e.g.

<http://example.org/foo> rdf:type owl:Class .

now assuming that you have an XML Schema type you might 
integrate this with OWL as such:

my:date-format rdf:type owl:DatatypeProperty .

so the idea that a fragment of XML conforms to a certain 
XML Schema _type_ is the same as saying that such a 
fragment belongs to a corresponding OWL _Class_.

So in summary, you might assign free-form 
RDF 'individuals' to classes, and you might also assign 
fragments of markup to classes, both using OWL (which is 
the successor to DAML).

Both RDF and XML approaches to 
classifications/ontologies need to coexist on the 
semantic web.

Jonathan
> Which is why we were going to markup even before the web: to seed the 
> classification of the published information, to build "local contexts". 
> 
> Again, markup is not about identification;  it is about classification.  
> The notion was that free text data using RTF-like systems organized and 
> labeled for presentation, not querying, or reuse.  The horror on the 
> SGMLers faces when given HTML was not that it would be an unworkable 
> system; we knew it would go as all gencoding does, like a grassfire. 
> The horror was that it returned us to and amplified the bad old days 
> of data not being reusable or easily classifiable. 
>    
> The idea for content-modeled SGML was that preclassifying the information using 
> human 
> intelligence (authors) would help.   HTML set everything back about two 
> decades (a success for publishers but a disaster for everything else).
> 
> RDF is a better ontology language but once again, one gets that HTML 
> effect of predetermined definitions over syntax at the level of 
> the classifying terminology.   There is a definite deja vu in the semantic web. 
> This doesn't mean RDF doesn't work or isn't a more precise means of 
> classification.   It is.
> 
> What would be the effect (theoretically) of ceasing to use HTML for 
> any records of authority (assertions for data mining candidates) 
> and using XML without RDF?  One lives without asserted relationships 
> for one.  Ok, so Topic Maps are added.  What then is the role 
> of RDF?
> 
> I ask because although the answers are somewhat obvious, it is also 
> obvious that RDFing any information published is too much overhead 
> and the ROI has to be good before people will do it.  On the other 
> hand, the use of XML DTDs and schemas is already well accepted and 
> it may be preferable to focus development of engines that mine based 
> on these.
> 
> One point of the article referenced originally was that Google could
> live large using the same web we have now and not have to insist on 
> a stratified web.  Of course, some potential profits based on hoarding 
> the ontologies disappears, but that is an information ecosystem 
> catastrophe in the making anyway.
> 
> len
> 

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.