Subject:Convert XML to UTF-8 Author:(Deleted User) Date:05 Jan 2009 08:42 AM
Hi Scott,
the declaration specifies the UTF-8 encoding; and if Stylus loads it without complaining, it means it's valid UTF-8. How are you generating it? Could you zip it and post it in the forum so that we can double check it?
Subject:Convert XML to UTF-8 Author:(Deleted User) Date:05 Jan 2009 12:15 PM
Hi Scott,
your XML files contains a few non-latin characters (like the accented "e", the "degree" and the "registered name" symbols) that are not in the range 0x00-0x7F. So, in UTF-8 they are encoded using multiple values; in your case, for instance, the "degree" symbols, whose Unicode codepoint is 0xB0, is stored as 0xC2 0xB0. This is correct UTF-8; so, something is wrong in your client configuration. Do you know which software he is using, and can you get the offset where the wrong codepoint is reported?
Subject:Convert XML to UTF-8 Author:(Deleted User) Date:06 Jan 2009 09:47 AM
Hi Scott,
I tested the validator with a subset of the file (20Mb is too big for their web interface) and it pass the well-formedness test. The validation step clearly fails because it's missing a schema. Could it be that the error your client reports is related to a schema validation error, and not to a wrong UTF-8 codepoint?