|
[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Problem with encoding UTF-8
David at roamware writes: > If I take the file he sent me and in UltraEdit 32 use the UNICODE/UTF-8 -> > UTF-8 conversion option save the file and then pop it through my program, > all works fine. This ambiguous conversion is explained thus "This function > will convert the complete file from Unicode or UTF-8 (ASCII representation) > to UTF-8 (with the file internally as Unicode)" > > So I am at a bit of a loss to explain what the file format has to do with > this, the PDF exports the file with the "encoding=UTF-8" in the xml element. > Any experience of this behaviour and how to get around it? I cannot change > what the PDF exports so it will have to be a "not strict" switch or > something on the parser I suppose (couldn't find reference to such a thing > mind you.). Can you examine the differences between the two files? (I would use GNU Emacs and its M-x ediff-files command.) What does GNU recode tell you about the original file when you "convert" it with utf-8..dump (with and without the --strict option)? -- Kevin Rodgers
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|

Cart








