Re: XML's Scylla and Charybdis - parse and regexp
jonathan.robie@d... (Jonathan Robie) writes: >I wasn't so much asking whether faithful lexical round-tripping is an >important goal for me, I think I know the answer to that question. I >was more interested in asking why this is important to you, and which >problems you are solving for which such distinctions are significant. Most of my day-job use of XML is heavily dependent on entity processing, both for chapters and for special characters. Preserving the original without normalizing the document is useful in such cases, to say the least. CDATA section handling/ignoring has become an issue for me in the last week, thanks to a fairly broken set of assumptions made by a database in its exporting XML. That gets especially interesting when combined with cases where CDATA sections are used intentionally and where you want to avoid processing entities. Whitespace in tags, quote styling on attributes, and other similar features are less of a priority to me, though they can be very useful for marking content that's to be changed using text-based tools without affecting consuming applications. I don't need them as much personally, but as long as I'm going to the effort, I might as well preserve them and save someone else the effort later. -- Simon St.Laurent Ring around the content, a pocket full of brackets Errors, errors, all fall down! http://simonstl.com -- http://monasticxml.org
PURCHASE STYLUS STUDIO ONLINE TODAY!
Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!
Download The World's Best XML IDE!
Accelerate XML development with our award-winning XML IDE - Download a free trial today!
Subscribe in XML format