[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: resolve html entities

Subject: Re: resolve html entities
From: Maximilian Gärber <max@xxxxxxxxxx>
Date: Mon, 31 Oct 2005 11:29:43 +0100
html entities star
One last question:
As the html is well formed xml (xhtml), can you point me to a resource (dtd) to start with?


Thanks,

Max

David Carlisle wrote:

2.) get a fitting dtd/schema which maps these entities to unicode characters

Would either one be a good starting point?



It would have to be a dtd (schema's don't do entity definitions) This is the "standard" way of doing this so long as the "html" you are getting is well formed xml. But most html isn't even valid html never mind being well formed, in which case, as Michael said, using tag soup is a better option as it is designed to forgive at places where a browser would forgive (but an xml parser would give a fatal error)..

David


________________________________________________________________________ This e-mail has been scanned for all viruses by Star. The service is powered by MessageLabs. For more information on a proactive anti-virus service working around the clock, around the globe, visit: http://www.star.net.uk ________________________________________________________________________

Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.