[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: standard compressed XML format?
Have you look at xMill ? Url : ----- http://www.seas.upenn.edu/~liefke/xmill/xmill.html Abstract from the page : ------------------------ An Efficient Compressor for XML (Mirror from AT&T Labs-Research) XMill is a new tool for compressing XML data efficiently. It is based on a regrouping strategy that leverages the effect of highly-efficient compression techniques in compressors such as gzip. XMill groups XML text strings with respect to their meaning and exploits similarities between those text strings for compression. Hence, XMill typically achieves much better compression rates than conventional compressors such as gzip. XML files are typically much larger than the same data represented in some reasonably efficient domain-specific data format. One of the most intriguing results of XMill is that the conversion of proprietary data formats into XML will in fact improve the compression - i.e. the the compressed XML file is (up to twice) smaller than the compressed original file! And this astonishing compression improvement is achieved at about the same compression speed. Regards, Sebastien >>>>>>>>>>>>>>>>>> Original Message <<<<<<<<<<<<<<<<<< On 3/16/00, 6:33:59 PM, "Simon St.Laurent" <simonstl@s...> wrote regarding standard compressed XML format?: > Is anyone doing any work on a standard compression format for XML > documents? > I'm starting to get concerned about the volume of complaints I'm getting > from readers and folks in Web development forums who are starting to > argue > that XML's verbosity is a problem, especially for things like > transmitting > vector graphics information. There are a lot of wasted bits in XML > documents - and of course in HTML and other text documents as well. > I'm not happy about the prospect of sending documents to browsers as > .zip > or some other compressed format and making users go through multiple > steps > to decompress and view the content. I'd like to think that we could > come > up with a compression/decompression algorithm for markup (maybe just > XML, > maybe all text) that we can use transparently. Ideally, it would be an > algorithm explicitly placed in the public domain, avoiding licensing and > legal battles. > Some folks have argued that this belongs in transfer protocols, while > others have argued that it should be a 3rd party function, like .zip and > .sit are today. I'm not convinced by the first because so many > competing > formats (gif, jpeg, flash, etc.) already include compression, and I'm > not > convinced by the second because I don't think users are willing to > micromanage such a process. > It also has an impact on some of the discussions on the IETF-XML-MIME > discussion (see http://www.imc.org/ietf-xml-mime/ for archives and > information) because we're already discussing how best to mark > information > as XML for possible generic processing. If a compression standard > emerged, > it might well have an impact on MIME types - and I'd like to see that > discussion start before we settle the MIME types for XML debate. > Any thoughts? I like the fact that XML is verbose when I'm editing and > processing, but it's not so good in transmission. I'd like to think > that > there's a good _general_ solution that will let us have the best of both > worlds. > Simon St.Laurent > XML Elements of Style / XML: A Primer, 2nd Ed. > Building XML Applications > Inside XML DTDs: Scientific and Technical > Cookies / Sharing Bandwidth > http://www.simonstl.com > ************************************************************************ > *** > This is xml-dev, the mailing list for XML developers. > To unsubscribe, mailto:majordomo@x...&BODY=unsubscribe%20xml-dev > List archives are available at http://xml.org/archives/xml-dev/ > ************************************************************************ > *** *************************************************************************** This is xml-dev, the mailing list for XML developers. To unsubscribe, mailto:majordomo@x...&BODY=unsubscribe%20xml-dev List archives are available at http://xml.org/archives/xml-dev/ ***************************************************************************
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|