|
[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: XSLT script to report Unicode characters and code
Colin
> Yes. XML Schema (and hence XPath) regular expressions.
They don't help do they?
Take alpha U+0391. The UCD says that is Lu so it matches \p(Lu) but that
just tells you it's a lower case letter, it doesn't tell you it's in the
block
<block start="00370" end="003FF" name="Greek and Coptic"/>
does it? The code I pointed to in the message you replied to would take
an alpha, get its code point, and find the string "0039" as being the
first four digits of a five digit hex representation of the codepoint,
then find this block element in unicode.xml, and thus (for example) to
http://www.unicode.org/charts/PDF/U0370.pdf
which is the pdf file which has the alpha glyph example.
Actually regexp could help, you could take the block range information
and build a regexp that matches each block by generating teh required
charater range expressions, but I think it's more natural to do that as
an xpath query rather than forcing it through the regexp engine.
David
________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.
This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs.
________________________________________________________________________
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|






