Re: SAXON and UTF-8
Thanks Mike, > This message suggests that there's no problem with your UTF-8, but > there is a problem with your XML. Without seeing the file, I can't > tell you what the problem is. The original document was too large to quote, but here's an example that triggers the same error in SAXON: <?xml version="1.0"?> <a>ä</a> The file is called "a.xml" and I intend to store it in UTF-8, so there's no need to declare the use of Latin 1. Here's the stylesheet: <?xml version="1.0"?> <xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform" version="1.0"> <xsl:template match="a"> <b><xsl:value-of select="."/></b> </xsl:template> </xsl:stylesheet> This is the result when I save a.xml in "Unicode" format: <?xml version="1.0" encoding="utf-8"?><b>Ã¤</b> As expected, but why is the encoding declaration added? When saving a.xml in "UTF-8" I get the same error as before: E:\test>saxon a.xml a.xsl > b.xml Fatal error reported by XML parser: required character (found "?") (expected "<" ) URL: file:/E:/test/a.xml Line: 1 Column: 5 Error required character (found "?") (expected "<") Transformation failed EditPlus seem to inherit the UTF-8 and Unicode functionality from the Windows API, so therefore I get the same error when saving as UTF-8 in Notepad in Windows 2000. Regards, Gustaf Liljegren XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
PURCHASE STYLUS STUDIO ONLINE TODAY!
Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!
Download The World's Best XML IDE!
Accelerate XML development with our award-winning XML IDE - Download a free trial today!
Subscribe in XML format