[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: MSXML DOM Special Chars Less Than 32

msxml mime
From: "John Cowan" <jcowan@r...>
> Rick Jelliffe scripsit:
> >    * The RFC on MIME types talks about "textual" data rather than the
> >      text/binary distinction.
> >       So control characters in an ASCII file may be "text" to some
> >       but they are not "textual". 
> The 39 Articles of the Church of England ban "the Romish doctrine of
> Purgatory" and "the sacrifice of the masses", but John Henry Newman
> (back when he was an Anglican, rather than a Roman, Catholic) showed
> that Romish was not the same as Roman, and the masses not the same
> as the Mass, so Anglicans could believe in the Roman doctrine of
> Purgatory, and celebrate the sacrifice of the Mass, without offending
> the Articles in the least.
Yes, as the beautiful "Apologia pro vita sua" shows, Newman really tried 
hard to stay. 

But in the case of textual, the definition comes from an RFC rather
than a musty post-counter-reformation don.  

RFC 2048[1] 
Multipurpose Internet Mail Extensions
(MIME) Part Two:
Media Types

"3. Overview Of The Initial Top-Level Media Types 
The five discrete top-level media types are: 

text -- textual information. The subtype "plain" in particular indicates plain text containing no formatting commands or directives of any sort. Plain text is intended to be displayed "as-is". No special software is required to get the full meaning of the text, aside from support for the indicated character set. Other subtypes are to be used for enriched text in forms where application software may enhance the appearance of the text, but such software must not be required in order to get the general idea of the content. Possible subtypes of "text" thus include any word processor format that can be read without resorting to software that understands the format. In particular, formats that employ embeddded binary formatting information are not considered directly readable. A very simple and portable subtype, "richtext", was defined in RFC 1341, with a further revision in RFC 1896 under the name "enriched". "

In section 4.1.2, about text/*  is interesting too: 

"Aside from these conventions, any use of the control characters or DEL in a body must either occur 
  1.. because a subtype of text other than "plain" specifically assigns some additional meaning, or 

  2.. within the context of a private agreement between the sender and recipient. Such private agreements are discouraged and should be replaced by the other capabilities of this document. "

Should XML follow or lead?  XML 1.0 was designed with painstaking attention
to fitting in with existing standards and infrastructure, and it has done because of it.

Rick Jelliffe

[1] http://www.nacs.uci.edu/indiv/ehood/MIME/2046/rfc2046.html


Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
First Name
Last Name
Subscribe in XML format
RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.