[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: XSL-FO Java API

Subject: RE: XSL-FO Java API
From: "Tim Meals" <tmeals@xxxxxxxxx>
Date: Tue, 28 May 2002 14:03:50 -0700
fo java
I took a look at iText, but didn't think it could access the content from a
PDF file.  I glanced quickly through the documentation, but I seem to
remember it saying that it could read a PDF file, but not access the content
of the PDF document.  I was able to open a PDF file using the API and access
the information, such as number of pages and page rotation, but didn't see a
way to access (for example) the first paragraph.

I guess a way to pseudo-transform a PDF file into a tree-like structure is
what I really want.  Ideally, from a PDF file, you would get:

<document>
   <documentinfo>
      <author>Joe Smith</author>
   </documentinfo>
   <page>
      <paragraph>
         Ipsum dolorem factum ...
      </paragraph>
      <image encoding="base64">
         MLSKDFOQJHGJHGHHH ...
      </image>
   </page>
</document>

It's basically like taking a PDF file and doing a reverse of what an XSL-FO
processor like Apache FOP does.  The resulting tree would probably be an FO
document.  Do any products do reverse transforms of PDF into FO?

Thanks,
Tim Meals
tmeals@xxxxxxxxx

-----Original Message-----
From: owner-xsl-list@xxxxxxxxxxxxxxxxxxxxxx
[mailto:owner-xsl-list@xxxxxxxxxxxxxxxxxxxxxx]On Behalf Of J.Pietschmann
Sent: Monday, May 27, 2002 12:42 PM
To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
Subject: Re:  XSL-FO Java API


Tim Meals wrote:
> Does anyone on the list have experience with various XSL-FO packages
> (Apache FOP, other commercial software) and their Java APIs?  I would
> like to access an XSL-FO document via an API tailored specifically to
> that format.  I'm thinking using DOM would be painful, as you'd have to
> parse the tree to pull apart the document.  Ideally, there should be a
> way to represent the FO document as a page, its attributes, and its
> content.

XSLFO is XML, and can be represented as DOM, or a SAX event
stream, or an unparsed charachter or binary stream.
The Apache FO processor allows you to use any of these for input.

I'm not sure what you mean by "represent the FO document as a page".
Layout and pagination are tasks of the FO processor. There is no
explicit page structure in a FO document, apart from explicit page
breaks and related constructs. There are Java libraries which
actually use pages and page content descriptions for generating
output, check out for example iText for generating PDF.

J.Pietschmann



 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list




 XSL-List info and archive:  http://www.mulberrytech.com/xsl/xsl-list


Current Thread
  • XSL-FO Java API
    • Tim Meals - Mon, 27 May 2002 12:01:38 -0400 (EDT)
      • J.Pietschmann - Mon, 27 May 2002 15:52:05 -0400 (EDT)
        • Tim Meals - Tue, 28 May 2002 17:16:09 -0400 (EDT) <=
          • bryan - Wed, 29 May 2002 04:13:46 -0400 (EDT)

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.