[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] RE: intercepting internal entities?
DuCharme, Robert writes: > Well, almost all. I couldn't find a mapping that let me output to > 7-bit ASCII (e.g. map auml, agrave, aacute, and acirc all to "a") > but I can see from the AnselInputStreamReader class that comes with > Mike Kay's GedML package (which demos his SAXON library) how to > derive my own StreamReader with my own mapping table. Look out > umlauts. Actually, for German, a-umlaut should map to 'ae', not 'a' (and o-umlaut to 'oe', and u-umlaut to 'ue'). Likewise, 'ß' should map to 'ss'. So, if you want to recode Goethe's line ... daß beide Männer recht haben möchten ... into US-ASCII, you would need to put out ... dass beide Maenner recht haben moechten ... The alternative, "das beide Manner recht haben mochten" looks very wrong to me, though I'm not a native (or even a very good) German speaker, and perhaps tastes have changed. It's interesting to note, though, that the '¨' in old or middle high German was (I think) originally just a tiny 'e' writting above the vowel by the scribe. This stuff is always harder than you expect. I don't know the rules in Finnish for example, but I'm willing to bet that they handle ASCII-fication completely differently. And, of course, once you start transliterating non-Roman characters, the real fun begins. All the best, David -- David Megginson david@m... http://www.megginson.com/ xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i... Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1 To (un)subscribe, mailto:majordomo@i... the following message; (un)subscribe xml-dev To subscribe to the digests, mailto:majordomo@i... the following message; subscribe xml-dev-digest List coordinator, Henry Rzepa (mailto:rzepa@i...)
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|