[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: multiple encoding specs (Re: IE5.0 does not conform to RFC2376)

  • From: Chris Lilley <chris@w...>
  • To: John Cowan <cowan@l...>
  • Date: Sun, 11 Apr 1999 03:15:44 +0200

multiple charset


John Cowan wrote:
> 
> Rick Jelliffe wrote:
> > However, it is all spoiled if there are systems which corrupt the
> > labels: for example by rewriting the charset parameter incorrectly. It
> > is far better to send the XML file without a charset parameter than to
> > send it with a wrong one.

Yes. But even better to send it with a correct one. This is easily done;
just ensure that the server always sends the same charset that the XML
encoding declaration specifies.

> But there's the snag: in text/xml documents, a missing charset parameter
> does not mean "Charset unspecified"; it means "Charset specified
> as US-ASCII".  

This is correct, the RFC does say that. Note that, this thread is
primarily about whether the RFC *should* say that or *should* say
something different, something which does not needlessly contradict the
XML 1.0 Recommendation.

> There is no way to fail to specify a charset in
> text/* documents, and rightly so, because text without a charset
> is uninterpretable.

This is disingeneous; both clauses are true, but the second one implies
that there is no other method of conveying the information, which,
clearly, there is.

So
a) There is no way to fail to specify a charset in text/* documents

But it does not have to be explicit. It can be implied.  good way of
formalising that implication would be to refer to the rules in the XML
1.0 Recommendation.

b) text without a charset is uninterpretable.

Also true, but that labelling is already defined in XML and handily
trravels with the document instance so that it is not lost as soon as
the document is saved to disk. 


> In SGML terms, omitting the charset in text/* documents is a mere
> minimization, whereas in application/* documents it is a true #IMPLIED.

Actually, if you read the XML Recommendation, then unless the charset is
UTF-8 or UTF-16, the charset (encoding declaration) is #REQUIRED

--
Chris



xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i...
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@i... the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@i... the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@i...)


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.