[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Is HTML structured or unstructured information?


structured and unstructured information
Bullard, Claude L (Len) wrote:
> ...
> HTML is the example many think they understand. 
> HTML is not just a presentational vocabulary.
> META tags, for example, are not presentational.
> FORM tags aren't strictly presentational.  Even 
> DIVs aren't strictly presentational.  In fact, 
> almost any tag has aspects of presentation and 
> content (note I am not using the term 'semantic' 
> here because presentation is a semantic). The 
> principle 'separation of presentation and content' 
> is flaky in practice.
> ...

hi, Claude

IMHO, presentation is not semantic : semantic is used for terms that 
means something ; you will say that "<b>" and "<i>" means "bold" and 
"italic", but as a meaning of a tag applies on its content, you can't 
say that :
"my name is <b>Philippe Poulard</b>" has not the same meaning that :
"my name is Philippe Poulard" ;
thus, "<b>" and "<i>" are semantically transparent

<b> has no semantic, it is just a stylistic information for a formatter
<i> has no semantic, it is just a stylistic information for a formatter
<title> has a semantic, and a formatter also use it as a stylistic 
information, but other tools (indexer) may use it with more importance 
than the rest (that is almost consider as plain-text)

(X)HTML is 90% non-semantic

Docbook defines more abstract presentation stuff that are not semantic, 
but it also defines more semantic structures (for example, things 
related to authors, etc)

(X)HTML is structured as well as RDBMS

A table in an RDBMS is structured, and usually semantic, but it is not 
an obligation : one could design a table named "paragraph" that contains 
  1 column "normal" and 1 column "bold", to express the same thing as 
above :
-----------------------------
|        paragraph          |
|---------------------------|
|normal    |bold            |
-----------------------------
|my name is|Philippe Poulard|
-----------------------------
but it is certainly a bad idea :)

the real question is not about "structured or unstructured" information, 
because by definition markup languages ARE structured, but rather about 
"semantic or not semantic" : XML as well RDBMS may structure both 
semantic and non-semantic information

-- 
Cordialement,

            ///
           (. .)
  -----ooO--(_)--Ooo-----
|   Philippe Poulard    |
  -----------------------

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.