XML Editor
Sign up for a WebBoard account Sign Up Keyword Search Search More Options... Options
Chat Rooms Chat Help Help News News Log in to WebBoard Log in Not Logged in
Conferences Close Tree View
+ Stylus Studio Feature Requests (1192)
- Stylus Studio Technical Forum (14621)
-> + Xmlconverter does not work (2)
-> + non-printing characters in Fil... (3)
-> + Including row nos while conve... (5)
-> + Getting Evaluation Copy except... (2)
-> + SS 2008 R2 Bug? (4)
-> + Java Heap Space (2)
-> + generate report from .xsl file... (3)
-> + Can't encode 0x4 in text (3)
-> - Recognize Japanese Characters (2)
-> ->Recognize Japanese Charac...
-> + converting .xsl to xml using c... (2)
-> - Uninstall doesn't clean up reg... (1)
-> + CDISC (5)
-> - creating database from XML sch... (1)
-> + error: side-by-side configurat... (3)
-> - How to convert pdf into rdf fo... (1)
-> + Apache FOP (5)
-> + XPath Query Editor 'Buggy' (3)
-> - Feature request (1)
-> + XML Convertors (2)
-> + XMLConverters version 3.2.0.0 ... (2)
-> + JVM/stylus studio abort on sav... (3)
-- Previous [961-980] [981-1000] [1001-1020] Next
+ Website Feedback (249)
+ XSLT Help and Discussion (7625)
+ XQuery Help and Discussion (2017)
+ Stylus Studio FAQs (159)
+ Stylus Studio Code Samples & Utilities (364)
+ Stylus Studio Announcements (113)
Topic  
Postnext
Jon GallegosSubject: Recognize Japanese Characters
Author: Jon Gallegos
Date: 29 Oct 2008 07:53 AM
Is there a way to recognize japanese characters?

I am getting an XML file from Japan. Some of the data is in Japanese (Kanji) some is in English. However the XML file I am to produce must be in English. I need a way to check each data field to see if it is Japanese(Kanji).

Posttop
(Deleted User) Subject: Recognize Japanese Characters
Author: (Deleted User)
Date: 30 Oct 2008 10:40 AM
Hi Jon,
do you need to check for them manually? In this case you can open the XML document in the XML Editor, press Ctrl-F to show the Find dialog, check the 'regular expression' check box and enter "[^\x00-\xFF]" (without quotes) as search expression; this will locate the next non-English character in the document.
If you need to look for Japanese characters programmatically, you can write an XQuery or XSLT 2.0 stylesheet that executes fn:matches($myString,".*(\p{IsKatakana}|\p{IsHiragana})+.*") to detect whether the given variable contains any Katakana or Hiragana symbol (look at http://www.w3.org/TR/xmlschema-2/#nt-IsBlock for the list of IsXXX strings)

Hope this helps,
Alberto

   
Download A Free Trial of Stylus Studio 6 XML Professional Edition Today! Powered by Stylus Studio, the world's leading XML IDE for XML, XSLT, XQuery, XML Schema, DTD, XPath, WSDL, XHTML, SQL/XML, and XML Mapping!  
go

Log In Options

Site Map | Privacy Policy | Terms of Use | Trademarks
Stylus Scoop XML Newsletter:
W3C Member
Stylus Studio® and DataDirect XQuery ™are from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2016 All Rights Reserved.