RE: Re: text() word lists
> > Not that I understand it, > but ( and ) seem to be included Michael? > <word>) - 71</word> > <word>(this - 11</word> > > > Is it modify by updating > for $w in tokenize(string(.), '[\s.?!,]+')[.] return > line? > > for $w in tokenize(string(.), '[\s.?!, )(]+')[.] return > seems to work. I only spent five minutes on this: producing a decent natural language tokenizer takes a little bit longer than that! Obviously its easy to write a more intelligent regex, I was only trying to illustrate the principles. Michael Kay XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
PURCHASE STYLUS STUDIO ONLINE TODAY!
Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!
Download The World's Best XML IDE!
Accelerate XML development with our award-winning XML IDE - Download a free trial today!
Subscribe in XML format