[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Characters having an ASCII value > 127
* Steffen Rodig | | If I try to use expat on the generated XML file, it tells me that it | is not wellformed at the position where such a character occurs. | Does the XML spec say anything about not permitting characters with | high ASCII values? If so, where? It doesn't. However, the XML spec _does_ say that unless XML entities have an XML declaration with an encoding declaration parsers are to assume that the entity is UTF-8-encoded. This means that if you have used ISO 8859 you may get problems, since these characters will either be mapped to a (seemingly) random Unicode code point or simply be invalid bit sequences that do not resolve to any character at all. | I guess, to correctly interpret and display those characters I have | to know the character set which was used to encode the original text | file. Bingo. | How can I communicate this character set to an XML parser? You do this on the XML declaration, like so: <?xml version="1.0" encoding="iso-8859-1"?> | I would be happy if anybody could point me to somewhere I could | start reading about this issue. Rick Jelliffe devotes a large part of The SGML/XML Cookbook to character sets and how they are used in XML and SGML. Other than that I don't know of any good resources apart from good old-fashioned digging in various places. --Lars M. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i... Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ To (un)subscribe, mailto:majordomo@i... the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@i... the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@i...)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|