[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Content v. attribute

  • From: robin@i...
  • To: xml-dev@i...
  • Date: Mon, 26 Oct 1998 08:46:04 -0600

robin vrijland
Re:
----------
So far, I've come up with this structure:
<METADATA
     TITLE="Title of the data"
     AUTHOR="J.J. Vrijland"
     DATE="26 October 1998">
</METADATA>
----------

I can think of several reasons why it's probably not
a good idea to try to model the "title" of an authored
work as an XML (SGML) attribute, given that the datatype
of XML's "attribute" is just (flat) 'string'.  In particular,
machine processing of "title" information should be
sensitive to the languages present in a work title, which is
most easily given in a "language" attribute.  So,
for example, think about the markup for these titles:

Comentarios al "Mein kampf"
Eclaircissements sur Mein Kampf: la doctrine d'Adolf Hitler
Hitler's Mein Kampf in Britain and America. A publishing history, 1930-39

These are "real titles" and, depending upon the language of
discourse in the broader work which references these works,
one would need two or three levels of nesting to capture the
fact that 'Mein kampf' is a title, in German, and that the
embedding string is a title, in some other language, and that
the larger discourse unit is XXX, in some third language.  This
is not a rare event - multilingualism "happens" in the
real world millions of time a day.

In my personal judgment, it also makes no sense to model
a "title" of a work (a document, a "chapter," a "section"
or whatever) as "metadata."  By whose definition of
"metadata"?  It's pretty difficult to find a definition
of metadata that will hold up, especially in terms of
articulating diagnostic/distinguishing features of
"metadata" vis-a-vis "data" which determine the best
modeling construct in SGML/XML.  Whether you want the
information to be "presented in the view" should not
be relevant, since XML encoding should be free of
assumptions about processing-level semantics.  Stylesheet
contols should dictate wheher/how some information is
presented or suppressed in a particular view.  Whether
the (rest of) "content" *could be understood* without
reference to the candidate information in question is
equally unhelpful: a novel is "understandable" without
the volume title and chapter titles, and it would be
understandable with every 12th word removed (if a bit
rough in places) as well.

To the extent that consciously introducing a distinction
between "data" and "metadata" into document encoding
means hard-coding a perspective, it may be a bad idea.
We already have encough problems dealing with the fact
that a particular (privileged) hierarchical modeling
of a problem domain introduces a certain distortion
(selected analytical perspectve) of the problem domain.
Neutral encoding in search of data independence would
try to eliminate these particularized interpretations
from the encoding model.

And finally, to wit (credits Steve DeRose): "your 'metadata'
is always someone else's 'data.'  I would add: what you
think is 'metadata' today will be your 'data'
tomorrow - you'll probably be sorry that you modeled the
distinction in markup.

Just my 2 cents... many will disagree, of course.

rcc

xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i...
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@i... the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@i... the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@i...)


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.