[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: XSL and international characters

Subject: Re: [xsl] XSL and international characters
From: Tony Graham <tony.graham@xxxxxxx>
Date: Tue, 4 Dec 2001 13:55:51 +0000
xsl junk characters
Marcin =?iso-8859-2?Q?K=B3os?= wrote at  4 Dec 2001 14:28:50 +0100:
 > Orginal character was %C5%82 and the result was &Aring; - one character and 
 > &#130; - second character :(

Your one-byte character is being represented using two bytes in UTF-8.

UTF-8 is a variable-length encoding, and characters may be represented
with up to four bytes (depending on the code point).

It's a fluke of the UTF-8 design that, for many Latin-1 characters,
looking at the UTF-8 representation in a Latin-1 system appears to
show the desired character plus random junk.

If you looked at the output with a UTF-8-aware viewer, you'd only see
the one character.

Regards,


Tony Graham
------------------------------------------------------------------------
XML Technology Center - Dublin                mailto:tony.graham@xxxxxxx
Sun Microsystems Ireland Ltd                       Phone: +353 1 8199708
Hamilton House, East Point Business Park, Dublin 3            x(70)19708

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.