[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Special characters in XSL processing pipeline

Subject: Special characters in XSL processing pipeline
From: Chris McGrath <cmcgrath@xxxxxxxxxxxxxxxxxxxxxxx>
Date: Thu, 21 Sep 2000 14:36:54 +0100
pipeline character
Hi there, I have a processing pipeline set up which scrapes a XHTML file
into XML. This is currently in two stages, the first extracts the
information, and the second re-organises it to fit the DTD. I am having a
problem with German characters in certain files, using MSXML3 I get a 'An
invalid character was found in text content' error for the ü in Düsseldorf.
I was under the impression I would be OK with these as they're in UTF-8. I
have tried using the HTML tidy utility from the w3c to turn these into
character references so I have &#252; instead of ü, and run this on the
output of each step in the pipeline before passing it on. This unfortunately
brings up another problem, due to  whitespace in the tidied output.

Am I doing something wrong or should it work without having to change the
character references?

Chris



 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.