[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: International Characters in attributes

Subject: Re: International Characters in attributes
From: David Carlisle <davidc@xxxxxxxxx>
Date: Mon, 12 Feb 2001 10:47:35 GMT
save international character in xml
> That's partly why people still use encodings other than utf-8. And
> once you do, the same numeric character references will mean different
> things in different encodings,

No, in XML and in HTML (4+) a numeric character reference always refers
to  the unicode position. It does not refer to th eposition in the
current encoding.

This is why I say that if your markup only uses ascii characters (as in
XHTML) the you can encode your file in any encoding (eg us-ascii)
without loss of information, as all characters are accessible via
numeric references.

The rules for XML are rather different than the rules for HTML (and the
rules for HTML changed over time as it migrated from latin1 to unicode
as its character repertoire). But for an XML system at least, it is
clear that any system that claims to be xml conforming has to accept
UTF8.

(It isn't only Asian languages that you mention that use "long" utf8
byte sequences, Unicode 3.1 promises to add around a thousand
mathematical aphanumeric symbols into plane 1, and these will be used by
MathML systems)

David

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.