[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: XML Binary and Compression

java xml compression schema based
Yes it is true that for the schema-based encoder/decoder insignificant
whitespace is lost. This behavior is similar to XSLT processor canonicalize
of whitespace.

Also the lexical format of content may not be preserved when using a
schema-based encoding approach (i.e. 100" and "1.0E2"). 

I'm not suggesting that the schema-based approach is one-size fits all. In
fact, as shown in our experiments, it is suboptimal for large datasets. But
it has value, especially for smallish schema-valid files and could be a part
of a broader solution to XML size optimization. By its vary nature no
optimization  solution will be optimal for everyone's requirements. But a
combination of solutions may meet 80% of the cases.

- Dan

> -----Original Message-----
> From: Elliotte Rusty Harold [mailto:elharo@m...]
> Sent: Thursday, March 13, 2003 10:39 AM
> To: winkowski@m...; xml-dev@l...
> Cc: winkowski@m...; msc@m...
> Subject: RE:  XML Binary and Compression
> At 9:18 AM -0500 3/13/03, winkowski@m... wrote:
> >Hmm, I'm sorry you don't think schema-based encoding is 
> fair. I find it odd
> >that you regard schema-based (encoding) compression as 
> lossy. This term is
> >normally associated with a permanent loss of information. 
> Neither ASN.1 or
> >MPEG-7 result in the loss of XML content (the original 
> content did not of
> >course contain the XML schema). The deployment of the schema 
> upon which
> >encoding/decoding is based in a management issue. There is no need to
> >transmit it as part of the encoded content.
> >
> I suppose it depends on the schema based encoding. The ones I've seen 
> do things like throw away white space they don't consider to be 
> significant based on data type. That's lossy. They also normally 
> require the same schema to be present on the receiving end for 
> decompression. I couldn't tell from skimming your paper whether that 
> happened in you data or not. At first I thought it didn't, but what 
> you posted here later indicated that maybe it did. Can you clarify?
> -- 
> +-----------------------+------------------------+-------------------+
> | Elliotte Rusty Harold | elharo@m... | Writer/Programmer |
> +-----------------------+------------------------+-------------------+
> |           Processing XML with Java (Addison-Wesley, 2002)          |
> |              http://www.cafeconleche.org/books/xmljava             |
> | http://www.amazon.com/exec/obidos/ISBN%3D0201771861/cafeaulaitA  |
> +----------------------------------+---------------------------------+
> |  Read Cafe au Lait for Java News:  http://www.cafeaulait.org/      |
> |  Read Cafe con Leche for XML News: http://www.cafeconleche.org/    |
> +----------------------------------+---------------------------------+


Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
First Name
Last Name
Subscribe in XML format
RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.