[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] RE: Unicode character blocks in strings
Try: <xsl:analyze-string regex="\p{{IsCJKUnifiedIdeographs}}"> <xsl:matching-substring> <out><xsl:value-of select="."/></out> </xsl:matching-substring> <xsl:non-matching-substring> <out><xsl:value-of select="."/></out> </xsl:non-matching-substring> </xsl:analyze-string> Regards, Michael Kay http://www.saxonica.com/ http://twitter.com/michaelhkay > -----Original Message----- > From: tom tom [mailto:tomxsllist@xxxxxxxxxxx] > Sent: 26 May 2009 14:08 > To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx > Subject: Unicode character blocks in strings > > > I have a string containing a mix of Chinese and Latin > characters, eg 0"8yM"<WPMH1N1Aw8PH7. > I wish to return a nodeset containing the following kind of structure: > > > > 0"8yM"<WPM > H1N1 > Aw8PH7 > > > Where H1N1 falls into the BasicLatin unicode character block > and the other two strings can be categorized as CJKUnifiedIdeographs. > > Can anyone suggest the cleanest way to do this using XSLT 2? > > Tom > > _________________________________________________________________ > View your Twitter and Flickr updates from one place (C Learn more! > http://clk.atdmt.com/UKM/go/137984870/direct/01/
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|