[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: standard compressed XML format?

  • From: Sebastien Sahuc <ssahuc@i...>
  • To: "Simon St.Laurent" <simonstl@s...>
  • Date: Thu, 16 Mar 2000 20:13:41 GMT

compressed xml
Have you look at xMill ?

Url : 
-----
http://www.seas.upenn.edu/~liefke/xmill/xmill.html	

Abstract from the page :
------------------------
An Efficient Compressor for XML
(Mirror from AT&T Labs-Research)



XMill is a new tool for compressing XML data efficiently. It is based 
on a
regrouping strategy that leverages the effect of highly-efficient
compression techniques in compressors such as gzip. XMill groups XML 
text
strings with respect to their meaning and exploits similarities 
between
those text strings for compression. Hence, XMill typically achieves 
much
better compression rates than conventional compressors such as gzip.

XML files are typically much larger than the same data represented in 
some
reasonably efficient domain-specific data format. One of the most 
intriguing
results of XMill is that the conversion of proprietary data formats 
into XML
will in fact improve the compression - i.e. the the compressed XML 
file is
(up to twice) smaller than the compressed original file! And this
astonishing compression improvement is achieved at about the same
compression speed.


Regards, 

Sebastien

>>>>>>>>>>>>>>>>>> Original Message <<<<<<<<<<<<<<<<<<

On 3/16/00, 6:33:59 PM, "Simon St.Laurent" <simonstl@s...> wrote 
regarding standard compressed XML format?:


> Is anyone doing any work on a standard compression format for XML
> documents?

> I'm starting to get concerned about the volume of complaints I'm 
getting
> from readers and folks in Web development forums who are starting to
> argue
> that XML's verbosity is a problem, especially for things like
> transmitting
> vector graphics information.  There are a lot of wasted bits in XML
> documents - and of course in HTML and other text documents as well.

> I'm not happy about the prospect of sending documents to browsers as
> .zip
> or some other compressed format and making users go through multiple
> steps
> to decompress and view the content.  I'd like to think that we could
> come
> up with a compression/decompression algorithm for markup (maybe just
> XML,
> maybe all text) that we can use transparently.  Ideally, it would be 
an
> algorithm explicitly placed in the public domain, avoiding licensing 
and
> legal battles.

> Some folks have argued that this belongs in transfer protocols, while
> others have argued that it should be a 3rd party function, like .zip 
and
> .sit are today.  I'm not convinced by the first because so many
> competing
> formats (gif, jpeg, flash, etc.) already include compression, and I'm
> not
> convinced by the second because I don't think users are willing to
> micromanage such a process.

> It also has an impact on some of the discussions on the IETF-XML-MIME
> discussion (see http://www.imc.org/ietf-xml-mime/ for archives and
> information) because we're already discussing how best to mark
> information
> as XML for possible generic processing.  If a compression standard
> emerged,
> it might well have an impact on MIME types - and I'd like to see that
> discussion start before we settle the MIME types for XML debate.

> Any thoughts?  I like the fact that XML is verbose when I'm editing 
and
> processing, but it's not so good in transmission.  I'd like to think
> that
> there's a good _general_ solution that will let us have the best of 
both
> worlds.

> Simon St.Laurent
> XML Elements of Style / XML: A Primer, 2nd Ed.
> Building XML Applications
> Inside XML DTDs: Scientific and Technical
> Cookies / Sharing Bandwidth
> http://www.simonstl.com

> 
************************************************************************
> ***
> This is xml-dev, the mailing list for XML developers.
> To unsubscribe, mailto:majordomo@x...&BODY=unsubscribe%20xml-dev
> List archives are available at http://xml.org/archives/xml-dev/
> 
************************************************************************
> ***




***************************************************************************
This is xml-dev, the mailing list for XML developers.
To unsubscribe, mailto:majordomo@x...&BODY=unsubscribe%20xml-dev
List archives are available at http://xml.org/archives/xml-dev/
***************************************************************************

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.