[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] RE: detecting character set of an XML doc
> -----Original Message----- > From: Dirk Germonpre [SMTP:dirkg@t...] > > Hello, > > If I'm writing an XML tool, how can I detect what character set is used > for > an XML document? I've read that for UTF-16, an encoding signature (xFEFF) > is used at the beginning of the document. Is there a different encoding > signature for each character set? > Not each "character set" - but each encoding, yes. They all exhibit different binary signatures, otherwise they would be the same... :) > If so, where can I find documentation on this? You can start with the XML spec, appendix B (I think it's B) contains some brief information. Or John Cowan posted a C decoder which detected the most common ones, or you can find my perl Apache module on CPAN, which detects the character sets and encoding. Matt. xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i... Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1 To (un)subscribe, mailto:majordomo@i... the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@i... the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@i...)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|