[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: SGML query: SHUNCHAR
"Rick Jelliffe" <ricko@a...> wrote: | If the file is coming in from the outside world, it is clearly a | mistake if there is a control character in the data, and an entity | manager which uses the "shunned characters" to strip out the control | char is not doing the wrong thing. (I would say that for internal | entities, it is a different matter.) What's the difference? Aren't both under the control of the entity manager, and didn't the internal entity reside in a file that came from the outside world? | The right thing to do does not come from ISO 8879": it just provides a | way to label characters with their class. IMHO, allowing NONSGML characters through via entity references can be a nasty gotcha. (Not to mention the validation fun with "smart quotes" and the like sprinkled in HTML pages by bogotic software.) It could be a problem with the existing APIs though: if they said "btw, here's a NONSGML character, you know how it came through <wink>", that could be different. | Ultimately, boils down to whatever SP does. SP does the right things per the standard. It doesn't necessarily do the right thing(s) for the consuming application (unless you integrate it and use the native API.)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|