[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Fw: Encodings and how they're specified

  • From: John Cowan <cowan@mercury.ccil.org>
  • To: Hermann Stamm-Wilbrandt <STAMMW@de.ibm.com>
  • Date: Tue, 5 Jul 2011 13:14:16 -0400

Re: Fw:  Encodings and how they're specified
Hermann Stamm-Wilbrandt scripsit:

> So an XML processor/parser should be able to deal with ebcdic.xml and
> correctly determine its "ebcdic-de" encoding, right?

"Should" is too strong.  Many, if not most, XML parsers will not
understand this encoding, though in that case they should successfully
reject it.  Appendix F explains how to identify a generic EBCDIC XML
document by looking for the "4C 6F A7 94" bytes with which it must
begin, though it is still necessary to read through the encoding
declaration in order to determine the exact flavor of EBCDIC in use.
The invariant character set (00640) can be used to decode the specified
encoding name, unless the encoding is code page 290, which does not have
lower-case Latin letters anyway.

http://recycledknowledge.blogspot.com/2005/07/hello-i-am-xml-encoding-sniffer.html
gives a detailed algorithm.

-- 
Note that nobody these days would clamor for fundamental laws        John Cowan
of *the theory of kangaroos*, showing why pseudo-kangaroos are   cowan@ccil.org
physically, logically, metaphysically impossible.    http://www.ccil.org/~cowan
Kangaroos are wonderful, but not *that* wonderful.     --Dan Dennett on zombies


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.