[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: library for parsing RTF

Subject: Re: library for parsing RTF
From: Emmanuel Bégué <eb@xxxxxxxxxx>
Date: Wed, 30 Jun 2010 09:29:27 +0200
Re:  library for parsing RTF
On Sun, Jun 27, 2010 at 11:45 PM, Andriy Gerasika
<andriy.gerasika@xxxxxxxxx> wrote:
> how about this one: http://rtf2xml.sourceforge.net/

This is very good and I've used it a lot; it tries to be very thorough
and succeeds most of the time.
Its drawbacks is that:
- it's slow (really slow for big files)
- it requires to install Python (which may or may not be a problem)
- the xml result is a little "thick"

For simpler tasks there is a simpler tool (very hard to find on Google
for some reason):

It's written in C and can be built for any platform; it's very fast.
It doesn't try to return every single detail of the source file but I
find it sufficient for most needs (it correctly identifies titles,
lists, emphasis, and tables).

And of course you can always save rtf files in an OpenOffice format,
which is native XML (zipped); this can be done in batch if need be.

Be aware that since all these tools produce a different result using
their own schema, you need to choose your parser before writing your

(Personnally and in hindsight, I'd try rtfx fist to see if it's good
enough for what you want to do; it's really the lightest approach of
all three).

- - -

This is if you need to read rtf; if you need to write rtf, then it's
quite easy to do directly from XSLT; I found this little book to be
very helpfull:


Current Thread


Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
First Name
Last Name
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.