[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Characters having an ASCII value > 127

  • From: Toby Speight <tms@a...>
  • To: "XML developers' list" <xml-dev@i...>
  • Date: 18 Sep 1998 14:06:28 +0100

ascii value list
Steffen> Steffen Rodig <URL:mailto:rodig@s...>

0> In article <199809181228.OAA16525@s...>, Steffen wrote:

Steffen> imagine a plain text file which I want to markup using
Steffen> XML. Now it could be that there are characters in this file
Steffen> whose ASCII value is greater than 127 (in PCDATA sections).

No character has an ASCII value greater than 127: ASCII is a 7-bit
encoding.  Of course, it's possible to use characters beyond ASCII,
since the Document Character Set for XML is Unicode.


Steffen> If I try to use expat on the generated XML file, it tells
Steffen> me that it is not wellformed at the position where such a
Steffen> character occurs.

Perhaps your XML declaration doesn't agree with the actual encoding
of the document (you don't say what either of these are for your
document).  See Sections 2.8 and 4.3.3, and Appendix F.


Steffen> I guess, to correctly interpret and display those characters
Steffen> I have to know the character set which was used to encode the
Steffen> original text file.

Of course - the parser is unlikely to be able to tell the difference
between the various parts of ISO 8859, for instance.


Steffen> How can I communicate this character set to an XML parser?

In the encoding declaration, <?xml encoding="utf-8"?> (or whatever).

You may prefer to write the problematic characters as entities or
character references, if they are rare in your source.  This may
allow you to write your documents in a smaller character set.  (As an
example, I find it easiest to author in ISO-8859-1, but I need to
define entities for the Welsh characters, which lie in the Latin-2
plane.)

-- 


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i...
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@i... the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@i... the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@i...)


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.