[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: rss regularis(z)ation


rss renderer
bryan wrote:

> One of the things I would want to use namespaces for is to return
> namespaced html instead of as you pointed out " the bizarre practice of
> CDATA-escaping random HTML-ish text " but this is only starting to be
> done now, why was it not done in earlier versions? What were the excuses
> for the bizarre practice

I agree that it's bizarre and offensive, but these people are not 
completely nuts.  Think of it from the point of view of the aggregator 
writer.  They want to parse an RSS feed as XML, and they want to parse 
each entry to get the <title> and <author> and <link> and so on.  Then 
they get to the content.  They have an HTML renderer which will render 
this prettily.  So they want to take all the bytes between <content> and
</content> (those are atom tags, not RSS tags, but same difference), and 
hand them to the HTML renderer.  They don't want to parse them, because 
they'd just be doing a no-op and putting them back to together again to 
hand them to the renderer.

On the producer's side, a lot of the authoring tools give authors a lot 
of freedom in whatever editing tool they like, and to enforce that this 
be XHTML is a lot of extra work that's not done yet.

So both the producers *and* the consumers are happier using this 
horrible escaped-HTML stuff.  I and several others have told them that 
they shouldn't want to do this, but it doesn't seem to work.

As several others have pointed out, if the content were well-formed they 
could do XPath magic, and filter out dangerous things like <script>, and 
bask in the glow of karmic goodness.  In response they say "I don't want 
to do XPath magic, and my HTML renderer has a safe-sandbox mode, and I 
just want the stuff I care about (<title>, <link>, remember) in XML and 
the rest is a bag of bits, so extend me no markup.

Realistically, I think we're stuck with it.  At least Atom will *let* 
you make the content well-formed.  Then evolution takes over.
-- 
Cheers, Tim Bray
         (ongoing fragmented essay: http://www.tbray.org/ongoing/)



PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.