Re: Microsoft FUD on binary XML...

To: Elliotte Rusty Harold <elharo@m...>
Subject: Re: Microsoft FUD on binary XML...
From: Alaric B Snell <alaric@a...>
Date: Sat, 22 Nov 2003 23:37:41 +0000
Cc: Tony Graham <Tony.Graham@S...>, xml-dev@l...
In-reply-to: <p06010201bbe4835029d4@[192.168.254.4]>
References: <004201c3af99$e9944010$650aa8c0@BOBDEV> <3FBDAFDA.3010905@a...> <3FBDF386.9090307@a...> <20031121.121152.50253888.Tony.Graham@S...> <3FBE14D8.7040405@a...> <p06010201bbe4835029d4@[192.168.254.4]>
User-agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.4) Gecko/20030704 Debian/1.4-1

Play the video

Elliotte Rusty Harold wrote:

> 
> One should keep in mind that Chinese and similar languages are quite 
> compressed to start with, far more so than English text is. For example, 
> in UTF-8 the English word "tree" takes four bytes. The Japanese word for 
> tree takes three bytes. 
 >

Good point, actually... I suppose that, in general, any language which 
uses more than 256 code points in general use is actually quite likely 
to be a language that uses one code point per word. So languages like 
Arabic, which are alphabet-based but not very compact in UTF-8 due to 
being composed of high-numbered characters (although I'm not sure how 
high so don't know if they would mainly be 2 or 3 bytes or whatever), 
would be better served by an encoding that mainly uses a shiftable 
window with single-byte characters, I guess.

ABS

Follow-Ups:
- RE: Microsoft FUD on binary XML...
  - From: "Alessandro Triglia" <sandro@m...>
- Re: Microsoft FUD on binary XML...
  - From: Tim Bray <tbray@t...>
- Re: Microsoft FUD on binary XML...
  - From: John Cowan <cowan@m...>

References:
- RE: Microsoft FUD on binary XML...
  - From: "Bob Wyman" <bob@w...>
- Re: Microsoft FUD on binary XML...
  - From: Rick Jelliffe <ricko@a...>
- Re: Microsoft FUD on binary XML...
  - From: Alaric B Snell <alaric@a...>
- Re: Microsoft FUD on binary XML...
  - From: Tony Graham <Tony.Graham@S...>
- Re: Microsoft FUD on binary XML...
  - From: Alaric B Snell <alaric@a...>
- Re: Microsoft FUD on binary XML...
  - From: Elliotte Rusty Harold <elharo@m...>

Prev by Date: Re: Relating to XML
Next by Date: Re: Microsoft FUD on binary XML...
Previous by thread: Re: Microsoft FUD on binary XML...
Next by thread: Re: Microsoft FUD on binary XML...
Index(es):
- Date
- Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Subscribe in XML format

RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >