[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Urgent help in XML parser
Your data is not UTF-8. It is probably the Windows Latin 1 code page, a.k.a "ANSI" a.k.a CP-1252. The SAX parser is correct to complain. Correct the encoding declaration to "WINDOWS-1252" which is the preferred name on the Internet. Cheers Rick Jelliffe ----- Original Message ----- From: "Malligeswari N" <malliga@d...> To: <xml-dev@l...> Sent: Tuesday, June 03, 2003 5:02 PM Subject: Urgent help in XML parser Hi All, I'm using SAX parser. My xml document has encoding style : 'UTF-8'. My inputdata looks like this - <DATA_DESCRIPTION><![CDATA[ TODAY'S DATE ]]></DATA_DESCRIPTION> My parser throws a errors while parsing this particular character " ' " - apos. " java.io.UTFDataFormatException: invalid byte 1 of 1-byte UTF-8 sequence (0x92) void org.apache.xerces.parsers.StandardParserConfiguration.parse(org.apache.xerce s.xni.parser.XMLInputSource) void org.apache.xerces.parsers.XMLParser.parse(org.apache.xerces.xni.parser.XMLIn putSource) void org.apache.xerces.parsers.AbstractSAXParser.parse(org.xml.sax.InputSource) ..." Pl. let me know how to solve this... Thanks and Regards, Malligen.
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|