2008/8/11 Colin Paul Adams <colin@xxxxxxxxxxxxxxxxxx>:
>>>>>> "Raja" == Sathasivam, Elayaraja <elayaraja.sathasivam@xxxxxxxxxxxxx>
writes:
>
> Raja> My ascii file output shows as b' Is it right ?
>
> No. That is not an ASCII character.
That's an a with circumflex, which usually indicates you've parsed a
multi-byte UTF-8 character using a single byte encoding.
For a good explanation read this: http://www1.tip.nl/~t876506/utf8tbl.html
Just make sure that whenever you go from characters to bytes and vice
versa that you always use the same encoding - in Java it's simplest
just to set the platform default encoding to UTF-8 eg:
-Dfile.encoding=UTF-8
If you displaying the result in a browser, make sure the appropriate
meta is set.
If you are just viewing the result in some text editor (say eclipse's
output pane) make sure that's also set to use UTF-8
Fun isn't it?
--
Andrew Welch
http://andrewjwelch.com
Kernow: http://kernowforsaxon.sf.net/
|