[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: WORD TO XML/SGML


docbook to word
Joe,

I've been developing tools and techniques to do this type of conversion for the last couple of years [*].  The latest manifestation of this work is in the DocBook XSL stylesheet repository (http://docbook.sf.net/) - a system for "roundtripping" DocBook via Word.  That is, there are XSL stylesheets for converting WordML (Office 2003's XML format) into DocBook, and stylesheets for the reverse.  One half of that is what you're looking for.  See also http://www.ausweb.scu.edu.au/aw04/papers/refereed/ball and http://ausweb.scu.edu.au/aw05/papers/edited/ball/poster.html

The techniques used are not specific to WordML, nor are they specific to DocBook.  I have developed stylesheets for my clients that target XML schemas/DTDs other than DocBook.

There's no single button, but it is achievable.  A big constraint is that your Word documents must be marked-up using styles, or at least "regular" or consistent in some sense.

Contact me if you'd like further information.

[*] Word->XML converters have been around for even longer, but the introduction of Word 2003 has made the process much more robust - not actually much easier, but more reliable.  There are some commercial products around that may help - DocSoft is one example, there are others.

Cheers,
Steve Ball


---


Steve Ball            |   XSLT Standard Library   | Training & Seminars

Explain         |     Web Tcl Complete      |   XML XSL Schemas

http://www.explain.com.au/ |      TclXML TclDOM        | Tcl, Web Development

Steve.Ball@e...  +---------------------------+---------------------

Ph. +61 2 6242 4099   |   Mobile (0413) 594 462   | Fax +61 2 6242 4099



On 19/08/2005, at 2:54 AM, Davis, Joe wrote:

Good morning.

I?ve been nominated to take some legacy technical manuals written in Word and Word Perfect and convert them into an SGML/XML format.  The manuals are a combination of text, table, and graphics.   The required DTDs, etc. should be supplied to us.  We?ve played with Word which will convert to HTML, such as it is.  The minor research that I?ve done does not explain how to convert a manual over.  

 

Without knowing what I?m doing, it appears as if each heading, paragraph, table/cell/row, graphic, and foldouts will need to be given individual tags.

 

Where is a good source for information on conversion?

Is there a program that will make my life easier (is there really an easy button?)

 

 

Any help will be gratefully received.

Thanks,

Joe

 




PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.