[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Xqueeze: Compact XML Alternative


linux compact xml
Alaric B. Snell wrote:

>>* Competition with compression: xqML in it's current format is as
>>  structured as XML so it too compresses well. In an experiment[1] a
>>  12 kB HTML document zipped to 2 kB. The (handwritten) BE for the
>>  same document took 3 kB and when zipped, it took less than 1000
>>  bytes.
>>    
>>
>
>Mmmm, it bugs me when people compare gzipped XML with $binary_format. They 
>should compare XML with $binary_format and gzipped XML with gzipepd 
>$binary_format. gzipped $binary_format will, in general, be the smallest of 
>them all, and yet faster to read/write than gzipped XML.
>
It doesn't necessarily (or even generally) work that way - compact 
binary formats don't generally compress down as well as text, so you end 
up with size(text) > size(binary) > size(compressed-binary) > 
size(compressed-text). That seemed to be the case with my XMLS format 
(http://www.sosnoski.com/opensrc/xmls - still on hold, though I hope to 
get back to it soon). One of the oddities of how compression works... 
David Mertz has done some research in this area - see his article at 
http://www-106.ibm.com/developerworks/library/x-matters13.html Also see 
James Cheney's paper "Compressing XML with Mulitplexed Hierarchical PPM 
Models" at http://www.cs.cornell.edu/People/jcheney/xmlppm/paper/paper.html

Try a range of documents and see how the compression works out before 
making any claims. For compression of XML text bzip2 looks like the best 
choice from what I've seen, so that should probably be the basis for 
comparison.

  - Dennis


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.