[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: Sweet nostalgia

  • To: "Greg Colyer" <greg-xml@e...>, <xml-dev@l...>
  • Subject: RE: Sweet nostalgia
  • From: "Hunsberger, Peter" <Peter.Hunsberger@s...>
  • Date: Wed, 21 Jan 2004 10:55:51 -0600
  • Thread-index: AcPgG6RsXOftt5R9RAeDHy0drJ5yAwAId96A
  • Thread-topic: Sweet nostalgia

sweet nostalgia
Greg Colyer <greg-xml@e...> writes:

<snip/>
> 
> This is interesting, from Sean McGrath on the diveintomark page:
> 
> > Programming languages that barf on a syntax error do so because a 
> > partial executable image is a useless thing. A partial document is 
> > *not* a useless thing. One of the cool things about XML as 
> a document 
> > format is that some of the content can be recovered even in 
> the face 
> > of error. Compare this to our binary document friends where a blown 
> > byte can render the entire content inaccessible.
> 
> I was pondering the other day in this context about the distinction 
> between text and binary, or more particularly between a 
> markup language 
> and a file format. Perhaps part of the problem is that Rec-XML is a 
> little schizophrenic <http://www.w3.org/TR/REC-xml#sec-documents>:
> 
> > A data object is an XML document if it is well-formed ...
> 
> > A textual object is a well-formed XML document if ...
> 
> The draconians know that "well-formed XML document" and "XML 
> document" 
> are identical: if it's not well-formed, it's not XML! The liberals 
> argue: then why is the superfluous term "well-formed" even 
> used? No-one 
> talks about a "well-formed JPEG file": it's either correct or 
> it isn't.

Hmm, I don't think I buy this argument.  This gets a back to Len's
question about "noise" and my response about error correction.  While it
is true that XML carries with it some extra information that makes it
somewhat more robust in the face of corruption it isn't true that this
extra information allows you to understand what the errors where. (Nor
is it true that all binary formats are useless in when a byte is
"blown", many of them carry enough information that errors are
recoverable.)

The fact is, the extra information in XML isn't of a form that allows
you to do robust error recovery.  It might allow you to decide that you
understand some portion of the document, but if a document is corrupt do
you really want to assume that a missing end tag is the only problem and
that the real problem isn't that an entire paragraph including the
missing end tag got deleted? 
 
<snip/>


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.