[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] encoding woes: ISO-8859-1 vs. UTF-8
Hi, I am confused with the recent behavior described following regarding encoding. I have a string "ôLogö" in CSV, with those two weird characters actually being “ and ”, characters in General Punctuation II. Here is the steps I am going through, consistently using ISO-8859-1 for encoding: A. Import CSV 1. convert CSV to generic XML: the string did not change, stayed "ôLogö". 2. saxon convert generic XML to proprietary XML: string got converted to "“Log”"; 3. import successful B. Export into CSV 1. pull from MSSQL7 to proprietary XML: "ôLogö" 2. saxon convert proprietary XML to CSV: exception org.xml.sax.SAXException: Output character not available in this encoding (decimal 8220) Why going one way it works and not the other? When I tried to consistently use UTF-8 as encoding, import CSV did not work with exception about character not supported. Any idea how I can resolve this problem would be very much appreciated. Thanks, Xiaocun __________________________________________________ Do You Yahoo!? Yahoo! Health - Feel better, live better http://health.yahoo.com XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|