[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: What is Data?

  • From: "Seth Johnson" <seth.johnson@realmeasures.dyndns.org>
  • To: "Costello, Roger L." <costello@mitre.org>, "'xml-dev@l...'"<xml-dev@l...>
  • Date: Mon, 31 Aug 2009 11:10:58 -0400

Re:  What is Data?


-----Original Message-----
From: "Costello, Roger L." <costello@mitre.org>
To: "'xml-dev@lists.xml.org'" <xml-dev@lists.xml.org>
Date: Mon, 31 Aug 2009 08:23:38 -0400
Subject:  What is Data?

> 
> Hi Folks,
> 
> Below is a definition of data, based on our recent discussions.
> I ask for your comments on these aspects:
>
> The following description of a book is not data, although it
> contains data: 
> 
>     In this groundbreaking book, evolutionary
>     biologist Jared Diamond stunningly dismantles
>     racially biased theories of human history by
>     revealing the environmental factors actually
>     responsible for history's broadcast patterns.


If you linked all the elements together with ID keys, you could
model the parsing of this like o:


Subject           Predicate       Object
Jared Diamond     action/verb     dismantles
Jared Diamond     dismantle act   theories
Jared Diamond     specialty       evolutionary biologist
Jared Diamond     preposition     in this groundbreaking
  dismantles                      book
Jared Diamond     preposition     in this groundbreaking
  dismantles                      book
  theories
Jared Diamond     preposition     in this groundbreaking
  dismantles                      book
  racially biased
  theories
book              adjective       groundbreaking
book              (not sure what) this
theories          adjective       biased
theories          preposition     of human history
dismantle act     adverb          stunningly
biased mode       adverb          racially
Jared Diamond     adverbial       by revealing
  dismantles      phrase?         the environmental factors
                                  actually responsible for
                                  history's broadcast
                                  patterns
Jared Diamond     adverbial       by revealing
  dismantles      phrase?         the environmental factors
  theories                        actually responsible for
                                  history's broadcast
                                  patterns
revealing         object          factors

and so on.

Just think of a data model for parsing language.


Seth



> Here is some of the data:
> 


> Here is some of the data:
> 
> There is an entity:
>     -	book
> 
> It has an attribute:
>     -	innovativeness: groundbreaking
> 
> There is an entity:
>     -	evolutionary biologist
> 
> It has attribute:
>     -	name: Jared Diamond
> 
> It has a relationship:
>     -	this entity is the author of the book entity
> 
> And so forth.
> 
> This example shows that text can be mined for data. 
> 
> 
> ANOTHER EXAMPLE
> 
> This is not data and it contains no data:
> 
>     Run really fast.
> 
> The sentence contains a verb followed by an adverb followed by
> an adjective. Verbs, adverbs, and adjectives are not data.
> 
> Data are nouns.
> 
> 
> xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
>         Simplification
> xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
> 
> Recent research suggests that there may be just two categories
> of data:
>     1. Entities
>     2. Relationships
> 
> An attribute is merely a special case of a relationship.
> 
> 
> EXAMPLE
> 
> Above we stated that these represent an entity, attribute, and
> relationship, respectively: 
> 
> John Smith
> Six feet tall
> Father of
> 
> Rather than considering "Six feet tall" as an attribute of
> entity "John Smith", we can consider "Six" to be an entity and
> there is a relationship (has a height of) between "John Smith"
> and "Six":
> 
> John Smith has a height of Six
> 
> Thus, in this example there are two entities ("John Smith" and
> "Six") and two relationships ("has a height of" and "Father
> of")
> 
> 
> xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
>         Data and Datum
> xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
> 
> Data is the plural of datum, a singular item. In practice,
> however, people use data as both the singular and plural form
> of the word.
> _______________________________________________________________
> ________
> 
> XML-DEV is a publicly archived, unmoderated list hosted by
> OASIS
> to support XML implementation and development. To minimize
> spam in the archives, you must subscribe before posting.
> 
> [Un]Subscribe/change address:
> http://www.oasis-open.org/mlmanage/
> Or unsubscribe: xml-dev-unsubscribe@lists.xml.org
> subscribe: xml-dev-subscribe@lists.xml.org
> List archive: http://lists.xml.org/archives/xml-dev/
> List Guidelines:
> http://www.oasis-open.org/maillists/guidelines.php


  • Follow-Ups:
  • References:

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2011 All Rights Reserved.