RE: [xsl] first question to the list: contains

Cart

XML Editor - Download a Free Trial >

See What's New >

Buy Now >

[Home] [By Thread] [By Date] [Recent Entries]

Subject: RE: first question to the list: contains
From: "Michael Kay" <mike@xxxxxxxxxxxx>
Date: Sat, 3 Nov 2007 08:49:51 -0000

> Scott, thanks for the great explanation.  Though it's 
> disappointing (as I need to check for _any_ Japanese 
> character to make this test effective), at least it makes sense. 

Scott's explanation is correct: you can't distinguish between a character
represented natively, and the same character represented as an entity
reference. And in your application, you shouldn't, because you really don't
want to constrain the document creator/sender to use one form rather than
the other.

XSLT 2.0 has good facilities for this. There's a function
string-to-codepoints which allows you to convert a string into a sequence of
integers representing the Unicode codepoints; or you can use regular
expressions which include constructs to match particular character
categories - 12360 is in the Hiragana block which is matched by
\p{IsHiragana}.

Michael Kay
http://www.saxonica.com/

Current Thread
first question to the list: contains Jared Stein - 2 Nov 2007 22:23:44 -0000 B. Kamer - 2 Nov 2007 22:30:39 -0000 Scott Trenda - 2 Nov 2007 22:40:47 -0000 <Possible follow-ups> Jared Stein - 3 Nov 2007 05:23:20 -0000 Michael Kay - 3 Nov 2007 08:50:27 -0000 <=

<- Previous	Index	Next ->
RE: first question to the lis, Jared Stein	Thread	Extract footnotes, J. S. Rawat
Extract footnotes, J. S. Rawat	Date	Re: Extract footnotes, G. Ken Holman
	Month

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >