|
[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: some character entities are now '?'
At 5:41 PM -0400 7/23/03, David M Williams wrote: Sun bug#4646959 http://developer.java.sun.com/developer/bugParade/bugs/4646959.html That bug had some helpful info in it Youch. What nasty bug. And contrary to Sun's claims this is a bug in Java 1.4. If I'm reading this right, any XML parser using InputStreamReader to translate UTF-8 into Java strings and chars is likely to miss malformedness errors that arise from bad UTF-8. This also seems to affect other character sets as well. It looks like you could work around this in 1.4 to have InputStreamReader report the bad data. However, that code would not be portable back to Java 1.3 (which does not have this bug). Thus you'd need separate code bases for 1.3 and 1.4 or some really ugly reflection based code. We really need to get this fixed. -- Elliotte Rusty Harold elharo@m... Processing XML with Java (Addison-Wesley, 2002) http://www.cafeconleche.org/books/xmljava http://www.amazon.com/exec/obidos/ISBN%3D0201771861/cafeaulaitA
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|
|||||||||

Cart








