[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Problem with encoding UTF-8

Subject: Re: Problem with encoding UTF-8
From: Kevin Rodgers <kevin.rodgers@xxxxxxx>
Date: Wed, 15 Dec 2004 10:24:56 -0700
kevin rodgers
David at roamware writes:
> If I take the file he sent me and in UltraEdit 32 use the UNICODE/UTF-8 ->
> UTF-8 conversion option save the file and then pop it through my program,
> all works fine. This ambiguous conversion is explained thus "This function
> will convert the complete file from Unicode or UTF-8 (ASCII representation)
> to UTF-8 (with the file internally as Unicode)"
> 
> So I am at a bit of a loss to explain what the file format has to do with
> this, the PDF exports the file with the "encoding=UTF-8" in the xml element.
> Any experience of this behaviour and how to get around it? I cannot change
> what the PDF exports so it will have to be a "not strict" switch or
> something on the parser I suppose (couldn't find reference to such a thing
> mind you.).

Can you examine the differences between the two files?  (I would use GNU
Emacs and its M-x ediff-files command.)

What does GNU recode tell you about the original file when you "convert"
it with utf-8..dump (with and without the --strict option)?

-- 
Kevin Rodgers

Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.