[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Parsing reports.

Subject: Re: Parsing reports.
From: Dave Pawson <davep@xxxxxxxxxxxxx>
Date: Mon, 26 Apr 2010 13:47:13 +0100
Re:  Parsing reports.
On Mon, 26 Apr 2010 04:50:35 -0700
"Vladimir Nesterovsky" <vladimir@xxxxxxxxxxxxxxxxxxxx> wrote:

> Hello!
> 
> I have many kinds of text reports that are output of some legacy
> programs. These are tabular reports: titles, headers, footers, pages,
> details, totals, subtotals, an so on.
> 
> My task is to transform these reports into a structured data, and
> then feed these data to
> a contemporary report engine.
> 
> I'm trying to understand now, if it's worth to approach to the task
> from the xslt side.
> Technically the task is solvable, however this would mean creation
> and support of many such 
> report parsers within xslt.
> 
> If anybody have already dealt with similar task?
> Any advices?
> Any xslt support libraries?
> Is it worth to deal with xslt here?

Depends how 'regular' the lines of text are?
XSLT can isolate lines of text and mark them up as XML.
Within each line, if the format is regular, regex can isolate
headers, footers etc. 

If the text is not regular, I'd suggest using a programming
language with good regexp support and finish manually if needed.

I've not seen any libraries that would help here.

Python or Perl would be good text handling tools 

HTH

-- 

regards 

-- 
Dave Pawson
XSLT XSL-FO FAQ.
http://www.dpawson.co.uk

Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.