|
[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: rss regularis(z)ation
Elliotte Rusty Harold scripsit: > > Feed the element content into > >a tag-soup parser, infer start- and end- tags to turn it into > >a tree, and strip out all the elements you don't want showing up > >in the aggregator output. Took me about two hours to code this up > >(to be fair, I did use an off-the shelf lexer for the first step). > > If you need to write your own tag soup parser, it ain't XML. That's > too much work for a job that shouldn't be necessary in the first > place. Fortunately, Java programmers don't need to write their own tag soup parsers; I did that. http://www.ccil.org/~cowan/XML/tagsoup -- It was impossible to inveigle John Cowan <jcowan@r...> Georg Wilhelm Friedrich Hegel http://www.ccil.org/~cowan Into offering the slightest apology http://www.reutershealth.com For his Phenomenology. --W. H. Auden, from "People" (1953)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|
|||||||||

Cart








