Subject:Invalid Characters Author:Alicia Roberts Date:21 Mar 2007 05:51 PM
Hello,
I'm trying to process data which is in spanish and I'm getting "an invalid character was found in text content". Example character such as, ó and Ñ. How do I avoid getting this error? Is there a why to process these characters.
Subject:Re: Invalid Characters Author:(Deleted User) Date:22 Mar 2007 04:06 AM
Hi Alicia,
if Stylus complains about "invalid characters" it means that your XML
file doesn't have the proper encoding declaration, and so it is
supposed to be encoded in UTF-8. Have you tried adding a <?xml
version="1.0" encoding="ISO-8859-1"?> declaration at the beginning of
the document?
Subject:Re: Invalid Characters Author:(Deleted User) Date:22 Mar 2007 10:28 AM
Hi Alicia,
that's exactly the point: the encoding of your document is
ISO-8859-1, and it will be parsed correctly only if that precise
encoding is specified.
Subject:Re: Invalid Characters Author:Goetz Heller Date:17 May 2007 06:44 PM
May be generally you'd be better off converting files to UTF-8 since this encoding supports full unicode so you don't have to worry about foreign characters (next time you are likely to find a character which is not covered by iso8859.1. Such conversion can be easily achieved by using an appropriate editor, e.g. notepad++ which is open source. Be aware that an UTF-8 file is usually prepended with a byte order mark (BOM). When opening such a file in an ascii editor, the BOM shows up as as sequence of 3 characters and you might wonder where they come from.