[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Semantic Web Hackings

  • From: Dan Brickley <Daniel.Brickley@b...>
  • To: xml-dev@l...
  • Date: Fri, 29 Dec 2000 13:23:58 +0000 (GMT)

restore aol

Resend-of-bounced-mail this time from my ILRT mailbox since
(ironically enough) lists.xml.org didn't know mailto:danbri@w... and 
mailto:daniel.brickley@b... are mailboxes owned by the same
individual. Replies might want to restore www-rdf-interest@w... to
the cc: list. cheers, --dan

From danbri@w... Fri Dec 29 13:18:07 2000
Date: Fri, 29 Dec 2000 07:42:57 -0500 (EST)
From: Dan Brickley <danbri@w...>
To: Martin Bryan <mtbryan@s...>
Cc: xml-dev@l..., www-rdf-interest@w...
Subject: Re: Semantic Web Hackings

On Fri, 29 Dec 2000, Martin Bryan wrote:

>
> Sean et al
> >      http://xmlns.com/foaf/0.1/mbox
> >      A web-identifiable Internet mailbox associated with
> >      exactly one owner.
>
> But what happens if you have multiple mailboxes or share mail addresses with
> others in a virtual workgroup? I have my main address, one for the virtual
> project I am currently working on and a number that identify working groups
> that I am part of. In different roles I use different addresses to have mail
> sent. How does one know know that mtbryan@s... =
> mtbryan@d... but represents only one part of xml-dev@l...?

Using metadata ;)

In RDF terms: it's a property of a property. Some properties
(foaf:personalMailbox, foo:humanGenomeChecksum, chat:aolScreenName,
foaf:personalHomepage...) have the characteristic that they pick out
at-most-one individual. For any given value for one of these properties,
we know that there exists at-most-one identifiable resource who has
that particular value for that property. This is really useful: it
depoliticises URI space by allowing for multiple ways of identifying
(through description) the selfsame resource. So useful in fact that this
meta-property is cropping up in RDF extension vocabularies (eg. DAML, OIL,
FOAF...).

The trick is to define these properties in a way that works around the
real-world fuzziness. Not all mailboxes are personal, but most can be
considered to have a single owner. So we can define for eg.
'personalMailbox' as a relation between indviduals and their (possibly
many) mailboxes. For any given personal mailbox, there is at most one
individual whose personal mailbox it is. Same for homepage,
aol-screenname or whatever. The fact that you might have multiple
personal mailboxes is just fine. That's just more data -- you can write
down in RDF/XML that there is a person who has mailboxes danbri@w... and
daniel.brickley@b... (or you can extract same from my PGP key,
acquiring info about my public key at same time). The fact that some
mailboxes (eg. xml-dev@l...) are not anybody's personal mailbox
is fine. The fact that some are shared is fine too, either we consider
them to have a primary owner, or we don't. We're in the business of making
a cartoon characture of the world here, not representing it in full
nitpicky detail.

Point of all this being that we shouldn't conflate individuals with their
various online representations and activities, but that we can
nevertheless use properties of the latter to indirectly identify the
former. People being just one (political) example; audio content
(MP3s/CDs) are another equally political case to consider. Compare the
flack you'd get for attempting to create a single URI scheme for
representing people, or MP3'd songs with the ease of using arbitrary
existing properties (screennames, mailboxes, tracklist-checksums etc).
Identification through description seems to be the way to go, IMHO.


(A slight aside...)

We claim RDF is good at merging data from multiple sources; in my
experience this is true. The current discussion suggests a crude taxonomy
of RDF data aggregation mechanisms:

(1) out-out-of-the-box aggregation ("naive graph merge")
    All RDF systems do this, by virtue of using URIs for identifiers
    to merge data from multiple sources.

(2) 2nd pass node convergence ("data smushing")
    As discussed above, strategies that merge together RDF from multiple
    sources in such a way as to figure out (in some cases) where
    anonymously-mentioned resources are descriptions of the same thing.

(3) Fancy Semantic Web inference stuff ("don't hold your breath...")
    As above but drawing additional conclusions based on complex rules
    and re-application of (2).


>From where I'm standing, (1) seems really handy, (2) is critical to
deploying this stuff in the grubby real world where things don't have
URIs, and (3) is, er, something to keep an eye on.

My working hypothesis (FOAF etc., more on which another time) is that (2),
ie. basic techniques for folding RDF data together even when URIs are
scarse, is enough to build something pretty cool.  Sure we have to make
some simplifying assumptions, but then that's what the Web's all about...

Dan


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.