[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: java Regex call

Subject: RE: java Regex call
From: David.Pawson@xxxxxxxxxxx
Date: Thu, 10 Jul 2003 11:31:47 +0100
java regex groups
Just curious Jeni,
Using: 

<xsl:output method="xml" indent="yes"/>
  <xsl:variable name="regex" select="'(([^_]*)_PARA)'"/>
  <xsl:variable name="input" select="'ABC_PARA__PARA'"/>
  <xsl:template match="/">
   <xsl:analyze-string select="$input" regex="{$regex}">
      <xsl:matching-substring>
        <RULE_MATCH>
          Input    <xsl:value-of select="$input" />
        </RULE_MATCH>
        <REGEX><xsl:copy-of select="$regex" /></REGEX>
        <GROUP1><xsl:value-of select="regex-group(1)" /></GROUP1>
        <GROUP2><xsl:value-of select="regex-group(2)" /></GROUP2>
        <GROUP3><xsl:value-of select="regex-group(3)" /></GROUP3>
.
.
      </xsl:matching-substring>
      <xsl:non-matching-substring>
        <mismatch><xsl:value-of select="."/></mismatch>
      </xsl:non-matching-substring>
    </xsl:analyze-string>

  </xsl:template>

produces

<?xml version="1.0" encoding="UTF-8"?>
<RULE_MATCH>
          Input    ABC_PARA__PARA</RULE_MATCH>
<REGEX>(([^_]*)_PARA)</REGEX>
<GROUP1>ABC_PARA</GROUP1>
<GROUP2>ABC</GROUP2>
<GROUP3/>
.
.
      <mismatch>_</mismatch>
<RULE_MATCH>
          Input    ABC_PARA__PARA</RULE_MATCH>
<REGEX>(([^_]*)_PARA)</REGEX>
<GROUP1>_PARA</GROUP1>
<GROUP2/>
<GROUP3/>
.
.

Why are there two lots of output please? 

regards DaveP


> -----Original Message-----
> From: Jeni Tennison 
> If you're using Saxon 7.6, why not use the built-in regular expression
> support rather than Java extensions?
> 
> > Example: if my regex is defined as:-
> >
> >  (([^_]*)_PARA)|((.*?)(PARA)(.*?))
> >
> >  and my input is "ABC_PARA"
> >
> >  Then I need to know what portions of the input matched
> >  each (if any) part of the regex groups, ([^_]*) and (.*?) etc,.
> >  in terms of group number and matching string.
> 
> If you use <xsl:analyze-string> then within the
> <xsl:matching-substring>, the regex-group() function gives you access
> to the values of the regex groups. Try:
> 
>   <xsl:for-each select="$rule//Type">
>     <xsl:variable name="tdlType" select="." />
>     <xsl:analyze-string select="." regex="{$regex}">
>       <xsl:matching-substring>
>         <RULE_MATCH>
>           <xsl:value-of select="$rule//@name" />
>         </RULE_MATCH>
>         <REGEX><xsl:copy-of select="$regex" /></REGEX>
>         <GROUP1><xsl:value-of select="regex-group(1)" /></GROUP1>
>         <GROUP2><xsl:value-of select="regex-group(2)" /></GROUP2>
>         <GROUP3><xsl:value-of select="regex-group(3)" /></GROUP3>
>         ...
>       </xsl:matching-substring>
>     </xsl:analyze-string>
>   </xsl:for-each>
> 
> Of course you need to know what regular expression is being used to be
> able to tell which regex groups you're actually interested in...
> 
> By the way, I'm not sure why you're using "//" in $rule//@name -- this
> will return all the name attributes of all the elements that are
> descendants of $rule, as well as of $rule itself, and return them in a
> whitespace-separated list (this being XSLT 2.0). Is that what you
> really want?
> 
> Cheers,
> 
> Jeni
> 
> ---
> Jeni Tennison
> http://www.jenitennison.com/
> 
> 
>  XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list
> 

- 

NOTICE: The information contained in this email and any attachments is 
confidential and may be legally privileged. If you are not the 
intended recipient you are hereby notified that you must not use, 
disclose, distribute, copy, print or rely on this email's content. If 
you are not the intended recipient, please notify the sender 
immediately and then delete the email and any attachments from your 
system.

RNIB has made strenuous efforts to ensure that emails and any 
attachments generated by its staff are free from viruses. However, it 
cannot accept any responsibility for any viruses which are 
transmitted. We therefore recommend you scan all attachments.

Please note that the statements and views expressed in this email 
and any attachments are those of the author and do not necessarily 
represent those of RNIB.

RNIB Registered Charity Number: 226227

Website: http://www.rnib.org.uk 

 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.