|
[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] RE: Re: text() word lists
> > Not that I understand it, > but ( and ) seem to be included Michael? > <word>) - 71</word> > <word>(this - 11</word> > > > Is it modify by updating > for $w in tokenize(string(.), '[\s.?!,]+')[.] return > line? > > for $w in tokenize(string(.), '[\s.?!, )(]+')[.] return > seems to work. I only spent five minutes on this: producing a decent natural language tokenizer takes a little bit longer than that! Obviously its easy to write a more intelligent regex, I was only trying to illustrate the principles. Michael Kay XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|

Cart








