[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: C1 characters in XML 1.0 and HTML 4

  • From: "Waters, Michael, Springer US" <Mike.Waters@springer.com>
  • To: "Michael Kay" <mike@saxonica.com>,<xml-dev@l...>
  • Date: Sat, 12 Mar 2011 20:07:59 -0500

RE:  C1 characters in XML 1.0 and HTML 4
>Occasionally the internationalization working group in W3C decides to flex its muscles,
>and one instance of this was there insistence that XSLT should not generate HTML
>that contains characters which HTML defines to be illegal.

Seems very reasonable to me. Until Bjoern reminded me, I forgot about the SGML declaration for HTML 4.

>It's probably a mistake that XML allowed these C1 characters, because they are
>nearly always miscoded CP1252 characters. XML 1.1 tried to fix this problem
>but we all know what happened to that.

Yes, indeed. We've tried to avoid the complications of handling XML 1.1 in our tool chain.

>In the meantime, the result is that you feed a bad character
>nto the start of your processing pipeline and you discover
>the problem at the final stage when HTML emerges.

I was just a bit surprised that the error was caught so far down the line.
 
>The reasoning of course is that the end user shouldn't pay the price
>for the content provider's carelessness.

>This is very different from the culture in W3C which tries
>to improve data quality by insisting that software should
>reject bad data.

I'm usually on the delivery side of things, so I'm always working to understand the content and prevent bad data from getting out there in the first place.

Many thanks, Dr. Kay.

Regards,
Mike





[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.