[Home] [By Thread] [By Date] [Recent Entries]
Dylan Walsh asked - > Hi. I am working on preparing examples of HTML, created by professional web > designers, for use with XSL transformations. Obviously the markup needs to > be made well-formed for this purpose. I am familiar with Tidy from the W3C, > however that utility goes beyond what we need, as it makes changes to > conform to their standards. This unfortunately results in visual anomalies > in the output. > > I understand the arguements behind XHTML etc., however the markup we are > given is designed to work look good with older browsers, on different > platforms. Is there any software out there that converts HTML to be > well-formed XML, but does not make changes beyond that, e.g. to obey the > HTML or XHTML standards? Here's one more response (although Chris Maden's suggestion for sx is probably the best) - Python has a module called "sgmllib" which by default produces ESIS output from an HTML file. You'd have to convert the ESIS, but all the white spaces would be there. It also has an "htmllib" built on the previous one. You'd have to modify one of the formatters to output xml, but it wouldn't be too hard. Cheers, Tom Passin
|

Cart



