[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Will XML change the character of W3C?

  • From: Rick JELLIFFE <ricko@g...>
  • Date: Thu, 19 Oct 2000 03:51:11 +0800

Re: Will XML change the character of W3C?
Sean McGrath wrote:

> 2) HTML is not not now and never was, an SGML application. Ever try
> feeding a general entity reference to a web browser? Ever try getting
> a html editor to create a CDATA section? HTML browsers skip
> over tags they don't recognise. Where in the SGML standard does
> it say that you can do that?

SGML allows a default ENTITY value. This could reference a PI which
reproduces the text of the entity reference. Or document an
error-recovery strategy an additional requirements document referenced
from the SEEALSO parameter.

If you want marked section delimiters to be ignored, map the <![
delimiter (MSS) to something else. 

But why would anyone want to?  One of the points of SGML is to allow
rigourous description of a language so that generic tools can be used on
archived data.
But after several years of tidy, Dave Ragget has still not captured the
syntax of HTML as actually practised.  The fact is that there are many
HTMLs, and some of these are more amenable to be treated as SGML than
others.

> 3) XML is an SGML application only after you change SGML a bit.
> Many - not all - but many SGML tools that predate this, um, adaption
> of the SGML standard will not process XML correctly. (Most software
> based on James Clarks awesome SP engine will. Most of the rest
> won't.)

Not every SGML tool has to be able to process every SGML document. 

ISO 8879 does not mandate any error-recovery policy. That most parser
were Draconian was because that was their default behaviour: you need to
tailor them to the particular DTD or syntax to get nice error-recovery. 
I have always thought that HTML was as much an error-recovery strategy
as a DTD. (And if that is true, then XHTML really misses the point!)  
 
> Are XML and HTML proper subsets of SGML
> in any meaningful sense?
> 
> Not in this universe.

Not very convincing.

XML is designed to be a proper subset of WebSGML; that it defines things
in addition to SGML is irrelevant. One can make WebSGML documents that
are HTML, and one can describe the extra features of most HTMLs in terms
of error-recovery or additional requirements using the mechanisms
WebSGML provides. But Sean is right that HTML is not SGML, but because
HTML as practised is a class of languages not a single language rather
than no HTML is SGML.


Rick Jelliffe

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.