[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] RE: Integrating a Search and Replace template with the
> The characters that are effecting things are part of the > UNICODE set 'General Punctuation'. This is translating > through the stylesheet fine and is being displayed in the > resulting XML by ’ (right hand quote) and – (en > dash). Problem is, my dynamic website does not know how to > display these characters, and I am getting the little boxes instead. It's not surprising that it doesn't know how to display them, since neither of these codepoints is assigned to any printable Unicode character. The Unicode codepoint for en dash is x2013; the code for "right single quotation mark" is x2019. What has happened is that your input uses the Microsoft-proprietary cp1252 character encoding. There's no harm in that, provided that the software reading the file knows it's in this encoding, so that it can translate such characters to their proper Unicode values for use in the output XML. > > I am thinking of integrating a Global Search and Replace > template that runs on the final XML to find all instances of > ’ and replace with ' . No, you should fix the problem at source rather than patching it up later. If you're reading the CSV file using unparsed-text(), and if the CSV file is in cp1252 encoding, then you can specify this in the encoding parameter to unparsed-text(). Michael Kay http://www.saxonica.com/
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|