[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: [Summary] UTF-8 Question: e with acute accent shouldrequi

  • From: Bill Kearney <wkearney99@h...>
  • To: Alessandro Triglia <sandro@m...>, xml-dev@l...,costello@m...
  • Date: Sat, 29 Sep 2007 16:42:03 -0400

Re:  [Summary]  UTF-8 Question: e with acute accent shouldrequi
Alessandro Triglia wrote:
> The whole discussion is about Unicode and ASCII!  It started with the
> following sentence in Roger's document:  "Here is a simple XML document.
> Most of its characters are ASCII, but there is one non-ASCII character, the
> é character"

No, it's not about ASCII.  If it were then the accented character would 
never have come up, as it CANNOT BE REPRESENTED IN ASCII.  Not at all.  

I haven't minutely examined the entire thread of messages, but I believe 
it was YOU, not the original post made by Roger that brought up this 
whole ASCII nonsense. 

The accented characters *CAN* be represented in ISO-8859 (most of them, 
anyway) and that's probably what his underlying OS and tools are assuming.

So I think the answer to Roger's question depends on what encoding he 
thinks he's using, and what the tools think.  It would appear to me that 
his example was from a document using ISO-8859, not actually UTF-8.   If 
that were the case, then seeing that accented character as E9 would be 
completely correct.  But if he WANTED to be using UTF-8 then E9 would 
not be correct. 

And note I'm using the label ISO-8859 without the trailing -digit.  Not 
to be inaccurate but to avoid opening up a whole other huge can of 
worms.  The accented e character can be represented as E9 is most, but 
not all, of the ISO-8859 variants.  I don't know which one he's using, 
and frankly don't think it would be useful to assume.  But it's probably 
not important, certainly not as important as quashing this whole ASCII 
nonsense.

When dealing with encodings the devil is in the details and there are a 
LOT of subtle nuances that make HUGE differences.  The last thing anyone 
SERIOUS about character handling needs to be basing their thinking on is 
ASCII.  Just stop.

-Bill Kearney
Syndic8.com



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.