[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: nextml

  • From: Uche Ogbuji <uche@ogbuji.net>
  • To: Amelia A Lewis <amyzing@talsever.com>
  • Date: Wed, 8 Dec 2010 22:18:58 -0700

Re:  nextml
On Wed, Dec 8, 2010 at 9:27 PM, Amelia A Lewis <amyzing@talsever.com> wrote:
I've
seen a number of "only UTF" comments, and I think that they're rather
western-centric, so I'm thinking "no," there (if someone whose native
language *isn't* west european proposes it, I might rethink).

Rick Jelliffe brings one of the most complete and coherent Eastern/Western perspectives I've ever encountered, and his proposal says:

"A Nuke document is UTF-8 in its external form. Inside a program, after parsing, it would typically use UTF16."

Yes, we all know about the politics and inertia that have affected uptake of Unicode in some geographies, but the "UTF-8 or UTF-16" is there for a very strong pragmatic reason.  Dealing with a pretty open-ended world of character sets, as in XML 1.0 is one of the biggest factors that complicate and slow down parsers, even if you get someone else (e.g. ICU) to do the relatively hard bits.

If we want to have a strong diversity of well-performing and conforming tools, which I suspect is an important component of success for most of us considering XML-NG, I think "UTF-*-only" is the simple reality.  For me, UTF-8 or UTF-16 is certainly an improvement over JSON's UTF-8 only.

I'm curious as to how that JSON limitation is affecting trends in text processing conventions in non-Western countries as "Web 2.0" becomes pervasive.


--
Uche Ogbuji                       http://uche.ogbuji.net
Weblog: http://copia.ogbuji.net
Poetry ed @TNB: http://www.thenervousbreakdown.com/author/uogbuji/
Founding Partner, Zepheira        http://zepheira.com
Linked-in: http://www.linkedin.com/in/ucheogbuji
Articles: http://uche.ogbuji.net/tech/publications/
Friendfeed: http://friendfeed.com/uche
Twitter: http://twitter.com/uogbuji
http://www.google.com/profiles/uche.ogbuji

  • References:
    • nextml
      • From: Amelia A Lewis <amyzing@talsever.com>

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.