[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: XSLT 2.0 : Unicode hex notation in regular expres

Subject: Re: XSLT 2.0 : Unicode hex notation in regular expressions
From: Pierrick Brihaye <pierrick.brihaye@xxxxxxxxxx>
Date: Thu, 12 Aug 2004 12:37:04 +0200
hex notation
David Carlisle a icrit :

> [\\u0600-\\u06FF]
>
>
> \\ is a literal \ so I  that matches
>  any one of characters \ u 0 6 F and all characters in the range  0 to \,
>  except that 0 is char 48 and / is char 47 so this range is empty.

OK, got it. I now know why ":" matches [\\u0600-\\u06FF]. It is because the colon is char 58 (x3A), between zero which is char 48 (x30) and the backward slash which is char 92 (x5C).

> You don't need the u-notation to enter  code points into regexp (and
> they don't work)

Sorry to insist : why don't they work ? Aren't they supposed to do ?

If so, is it a Saxon-related problem or a more general one that would indicate that UTS #18 is still to be implemented, is irrelevant or whatever ?

How, for example, to use a useful syntax like matches(.,'\p{Script:Arabic}+') ?

> as you can just enter the characters directly

Mmmh... not always easy because of control characters. For arabic, see http://www.fileformat.info/info/unicode/char/0600/index.htm.

> or if
> you want an ascii representation use xml character references,
> & # x a b c ;

Indeed. <xsl:when test="matches(.,'[&#x0600;-&#x06FF;]+')">arabic</xsl:when> gives me the expected result. Thanks for the reminder !

Cheers,

p.b.

Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.