[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: The Rising Sun: How XML Binary Restored the Fortunes of I


thunderbird missing mail
Elliotte Rusty Harold wrote:

> Alessandro Triglia wrote:
>
>> True, with XML 1.0 you can use any Unicode viewer (or any EBCDIC 
>> viewer, or
>> any SHIFT_JIS viewer, or any xyz viewer,  etc., depending on the
>> circumstances) -- you don't have to use a specific program like the 
>> MS XML
>> 1.0 viewer that is built into IE.  But still, if FI viewers became
>> ubiquitous, what would be the fundamental reason for concluding that 
>> FI does
>> not comply with the "view source" paradigm?  
>
>
>
> In the short term (by which I means a few years, maybe even a few 
> decades) there's probably not a lot of difference. In the long term, 
> i.e. centuries or more, the difference might become significant. Many 
> of the NOT XML formats are much harder to decode without pre-existing 
> knowledge of the format or even the specific schemas used to encode 
> the information. Whether this is true of the FI version of NOT XML or 
> not, I don't know. The real question is whether the full information 
> content of the document is present in each instance. The level of 
> redundancy also matters. Compression is the enemy of robustness.
>
A) The "Self Contained" property, which will have to be one of several 
modes of operation, captures having "full information content of the 
document ... present in each instance".  A format that has externalized 
structure, metadata, and redundancy of the data can be made 
self-contained if it is paired with an interpretable version of the 
externalized information.  (An important distinction is how much of this 
is interpretable metadata and how much is embedded in code.)  I have a 
strong preference that all externalized information be able to be 
represented in a standardized interpretable form.  This may need it's 
own spec or be part of a "binary XML" spec.

B) Only self contained instances are suitable for archiving.  As far as 
interpreting the format, the reason we want an open standard is so that 
the spec and code are available.  I am confident that our distant 
decendents will be able to read English and compile C code.

Robustness is important.  A self-contained format need not be much worse 
off than text-encoded XML.  You could argue that a TNLV format is 
actually a little more robust in some cases.  If you know the length of 
an element, for instance, you can deduce a little more when the end is 
missing whereas with a missing text end tag, you have less of a clue.  
The process of deduction and reconstruction might be slightly more 
difficult and more likely to be done with a program than a text editor, 
but it isn't that much of a stretch.

sdw

-- 
swilliams@h... http://www.hpti.com Per: sdw@l... http://sdw.st
Stephen D. Williams 703-724-0118W 703-995-0407Fax 20147-4622 AIM: sdw


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.