|
[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] RE: rss, in all its guises
>I dropped to a two stage. > 1. Java to read, check for well-formed only and save to disk >That kills non well-formed content. >> 2. RSS documents often contain escaped markup hidden inside >> CDATA sections. >Grrr :-) > Not in CDATA sections either. See Tim Brays blog. I tend to do the following: If well-formed get escaped html as a string, tidy escaped html. If escaped html is well-formed save escaped html body in same element it was gotten from, if not well-formed strip escaped html tags save resultant string to element it was gotten from. Send the whole to intermediary xslt which can output a single newsfeed format, as my architecture handles variant rss flavors, as well as other newsfeed formats. The whole rss thing really pisses me off because of the FUD that using escaped markup is in some way a sound design decision because it makes it easier for users. XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|

Cart








