[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: case-sensitivity in xml

Subject: Re: case-sensitivity in xml
From: Wendell Piez <wapiez@xxxxxxxxxxxxxxxx>
Date: Mon, 24 Jan 2005 12:02:15 -0500
xml lower case
At 06:59 PM 1/21/2005, it was written:
Wendell Piez writes:
> In general, case-folding is done with the translate function. So if
>
> <xsl:variable name="UPPER" select="'ABCDEFGHIJKLMNOPQRSTUVWXYZ'"/>
>
> <xsl:variable name="lower" select="'abcdefghijklmnopqrstuvwxyz'"/>
>
> then translate($string,$UPPER,$lower) will convert to lower case (at
least
> in the English/Latin alphabet).

English (ASCII/American) and Latin (ISO 8859-1/Western European) are not
the same.

These encodings are not the same, but I submit the alphabets are close enough to be reasonably considered the same ... of course it depends on your notion of "alphabet". :-> (Some might even take exception to the identification of the English alphabet with an American encoding standard!)

  But it's easy to include Western, Eastern, and Southern
European alphabets in your case conversion (see
http://www.unicode.org/charts/PDF/U0080.pdf
http://www.unicode.org/charts/PDF/U0100.pdf
http://www.unicode.org/charts/PDF/U0180.pdf):

<xsl:variable name="UPPER" select="...&#x00C0;&#x00C1;&#x00C2;..."/>
<xsl:variable name="lower" select="...&#x00E0;&#x00E1;&#x00E2;..."/>

Not to mention Greek and Cyrillic:

http://www.unicode.org/charts/PDF/U0370.pdf
http://www.unicode.org/charts/PDF/U0500.pdf

Well, it's easy providing you can determine a one-to-one mapping between lower-case and upper-case characters in every case.

Some alphabets present difficulties: for example what is the upper-case
version of the German "sharp s"? (Find discussion of these issues in the
archives to this list.) If the character "_" has to be converted to "SS",
the simple translate() function won't do.

Since the sharp s has to be unusual in tag names, however, I considered
such minutiae probably outside the scope of the OP's question.

Cheers,
Wendell


====================================================================== Wendell Piez mailto:wapiez@xxxxxxxxxxxxxxxx Mulberry Technologies, Inc. http://www.mulberrytech.com 17 West Jefferson Street Direct Phone: 301/315-9635 Suite 207 Phone: 301/315-9631 Rockville, MD 20850 Fax: 301/315-8285 ---------------------------------------------------------------------- Mulberry Technologies: A Consultancy Specializing in SGML and XML ======================================================================

Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.