[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] RE: XSLT match with regex what's the best current solu
Hi Gunther, as you can read in the regular expression thread http://www.biglist.com/lists/xsl-list/archives/200201/msg00488.html and further on, we are working on a tool which might be helpful for your purposes. It is some mixture between regexes and an XSLT-like language, and we have called it regexslt. It is implemented in Java, and could be some sort of pre-processor to transform your non-XML documents into XML and use XSLT thereafter. It is by no means as sophisticated as Omnimark and the like, but you would be welcome to give it a try if you would be interested. You'll find some code samples in the thread I referred to - and we will announce the first dot-oo release on this list (which would coincide with a new release of our website). If we can gather enough community around it, we would consider it becoming a Sourceforge project. Regards, Steven Noels http://outerthought.org/ (+32)478 292900 > -----Original Message----- > From: owner-xsl-list@xxxxxxxxxxxxxxxxxxxxxx > [mailto:owner-xsl-list@xxxxxxxxxxxxxxxxxxxxxx]On Behalf Of Gunther > Schadow > Sent: maandag 14 januari 2002 23:46 > To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx > Subject: XSLT match with regex what's the best current solution? > > > Hi, > > [for once this has nothing to do with dynamic scoping :-)] > > I am working on a suite of scripts that induce structure in > free text and eventually capture fine grained medical information. > I have been using AWK so far, but I am thinking about making > this a process largely of XML transformations. However, since I > must induce XML structure from semi-structured free text I need > some more parsing support. First, regular expressions. I know > there is EXSLT but are regex matches and replaces supported > in SAXON (I love SAXON, so I would prefer using it.) > > Also, any ideas of additional parsing tools and their integration > into XSLT would be appreciated. Is there a way of running XSLT > in line-mode and have every line matched against regular > expressions? Well, I suppose so, with a simple sed script I could > first wrap each line into a <line>...</line> tag and then use regex > match on the text node of each <line> element. > > Is SAXON easy to extend? I suppose there is some documentation > of SAXON that tells me how to write extensions in Java, right? > Any reason why it would be better to use something other than > SAXON if my platform is Java and I'm not interested in Web stuff > (in which case I would look into the Apache work.) > > thanks for your ideas, > -Gunther > > -- > Gunther Schadow, M.D., Ph.D. > gschadow@xxxxxxxxxxxxxxx > Medical Information Scientist Regenstrief Institute for > Health Care > Adjunct Assistant Professor Indiana University School > of Medicine > tel:1(317)630-7960 > http://aurora.regenstrief.org > > > > XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|