[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Minimal set of rules for making HTML well-formed?
On Fri, 11 Oct 2019 at 17:22, Costello, Roger L. <costello@mitre.org> wrote:
That looks like user error, with sgml (or xml) you would need to use a catalogue to supply a dtd that defines & nbsp; to be & #160;
This is a black hole that you might not want to approach see any of the 10000s of online discussions surrounding :-) The main problem that killed the polyglot idea is that making an html document that parses as well formed xhtml isn't enough in practice (you also want it to mean/render the same) but the requirement used in that document that the DOM trees resulting from an html or xml parse are the same, is very hard to achieve. 1 and 2 are rather hard to achieve without using an html parser on the front end (and if you have one of those, just dumping a serialisation of its dom tree will give xml (er except when it doesn't:-) the html parse _always_ gives a well defined result so just knowing what are the tags isn't easy with an arbitrary tool not using the html parser, and the tag names may not be valid xml names I just typed in this at random and it shows for example an attribute with name <kk that can't be expressed in xml. You might be tempted to say that's not valid input but again defining (and automatically checking ) what is valid really needs an html parser. 3) again yes so long as you know what the elements are and all the special rules around <script> etc. 4) of the characters that you mention only < and & need to be escaped in character data, ' and " do not, but you also need to escape any use of ]]> in character data, and use of control characters which are not allowed in xml 1.0 (actually you can't just escape the control characters, you need to remove them or encode them in some other way) David
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] |
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|