[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: xml over http - RFC 3023

  • From: "Andrew Welch" <andrew.j.welch@g...>
  • To: "Chris Burdess" <dog@b...>
  • Date: Fri, 28 Nov 2008 11:01:22 +0000

Re:  xml over http - RFC 3023
>> I think many parsers can read from a web resource, but few use the
>> encoding information from the content type.
>
> The thing is that XML documents are designed to be read where there is no
> external content-type information (such as from a filesystem) as well as
> where there is.

At the moment I'm:

- using the encoding in the ContentType
- if that's not present, using the encoding in the prolog (having read
those first few bytes in us-ascii)
- if that's not present deafulting to UTF-8

That seems to cover most bases.

Isn't there a reliance though, that the actual encoding and the
encoding used to serve the file match?  If the xml is windows-1252
(with the encoding correctly specified in the prolog) but served as
utf-8 then and characteres in the C0/C1 ranges will cause a parse
error won't they?

The only way would be to either read the prolog first and then serve
the file using the encoding specfied in that, or parse the XML then
serialize back to bytes using whatever encoding is used in the
contenttype (when serving a static file from disk).  It's almost as if
the server needs to treat XML as a special case.  I'm still learning
all this and thinking out loud here.



-- 
Andrew Welch
http://andrewjwelch.com
Kernow: http://kernowforsaxon.sf.net/


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.