[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Closing Blueberry

  • From: Elliotte Rusty Harold <elharo@m...>
  • To: xml-dev@l...
  • Date: Thu, 19 Jul 2001 22:38:55 -0400

top
At 9:10 AM +0900 7/20/01, Murata Makoto wrote:

>As for the Japanese language, I believe that I have demonstrated
>reasons: changes of unification and made-in-Japan Kanji require
>non-BMP name characters.  If Unicode becomes popular and we 
>continue to use XML 1.0, disallowed CJK ideographics will become 
>traps.
>

I'm starting to realize there may be a deeper issue here. Languages evolve. It's the nature of the things.There are dozens of new words every year, some to describe new technologies like fax machines and e-mail, some that get adapted from other languages (glasnost in English, le weekend in French), others that just arise. ("Doh" just made it into the Oxford English Dictionary.) I'd be surprised if Japanese and Chinese are any different in this respect. 

In alphabetic languages like English and Russian, new words are no big deal. They fit right into Unicode and XML with no hassle. But what happens in ideographic languages? I know Japanese uses Katakana for some of these words. Is it all of them? How many new ideographs come into use each year? And in Chinese? I suspect it's even worse, but I would appreciate hearing from the Chinese speakers on the list.

For the sake of argument, say we had perfect knowledge and could fix XML and Unicode so that it did cover all current ideographs used today in Chinese and Japanese, what do we next year? and the year after that? and the one after that? and every year for the next ten thousand years?

Unicode's answer is that someone fills out the right forms, proves that the characters are being used, and then they're added. There's plenty enough space in Unicode to handle several hundred new characters a year for the next ten thousand years. As I think Simon originally suggested we could just tie XML to Unicode and leave it at that.

However, any fixed solution along the lines of XML 1.0 is guaranteed to fail, especially if the criterion for success is that all characters anyone wants to use but be available for use in XML names. We and our descendants will be revisiting these arguments every five years for the next few millennia. 
-- 

+-----------------------+------------------------+-------------------+
| Elliotte Rusty Harold | elharo@m... | Writer/Programmer |
+-----------------------+------------------------+-------------------+ 
|          The XML Bible, 2nd Edition (Hungry Minds, 2001)           |
|              http://www.ibiblio.org/xml/books/bible2/              |
|   http://www.amazon.com/exec/obidos/ISBN=0764547607/cafeaulaitA/   |
+----------------------------------+---------------------------------+
|  Read Cafe au Lait for Java News:  http://www.cafeaulait.org/      | 
|  Read Cafe con Leche for XML News: http://www.ibiblio.org/xml/     |
+----------------------------------+---------------------------------+

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.