[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Processing two documents, which order?

Subject: Re: Processing two documents, which order?
From: Brandon Ibach <brandon.ibach@xxxxxxxxxxxxxxxxxxx>
Date: Thu, 7 Apr 2011 10:47:29 -0400
Re:  Processing two documents
You'll probably need to run some tests to verify performance of
various approaches, but my hunch would be to combine the list of words
into a single regex, let the regex implementation optimize it and do a
single pass over the document.

Based on Michael's comments, you probably want to build the regex in a
global variable or, if Saxon doesn't recognize that this would mean
the regex is the same every time through, dynamically build the final
transform that will actually process the document (2) in a separate

-Brandon :)

On Thu, Apr 7, 2011 at 9:25 AM, Dave Pawson <davep@xxxxxxxxxxxxx> wrote:
> I have two xml documents.
> The first is a list of marked up words (1),
> the second a 'normal' xml document (2)
> For each occurrence in 2 of a word from 1
> I need to mark up the word with <property> </property>
> Which order is anywhere near optimum?
> Document 1 has about 300 words,
> Document 2 is 33,000 lines.
> This is the template to do the work
> <xsl:template match="*">
>    <xsl:param name="property" as="xs:string"/>
>    <xsl:analyze-string select="." regex="({$property})[\s\p{{P}}]">
>      <xsl:matching-substring>
> <!--    <xsl:message>match on [<xsl:value-of
> select='regex-group(1)'/>]</xsl:message> -->
> <property><xsl:value-of
> select="regex-group(1)"/></property> </xsl:matching-substring>
>      <xsl:non-matching-substring>
>        <xsl:copy-of select="."/>
>      </xsl:non-matching-substring>
>    </xsl:analyze-string>
>  </xsl:template>
> but I'm hesitating as to which loop sequence will work best?
> --
> regards
> --
> Dave Pawson
> http://www.dpawson.co.uk

Current Thread


Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
First Name
Last Name
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.