[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: WordML to XML

Subject: RE: WordML to XML
From: "Touchtel" <omprakashv@xxxxxxxxxxxxxxxxx>
Date: Sat, 12 Feb 2005 16:16:40 +0530
wordml to xml
     Hi,
        I too have been trying to figure out how this could be done when you
got there first. I  knew some kind of look-ahead would be needed as in the use
of the following-sibling but didn't     know the exact means. Does the use of
priority have any specific significance and also the use of the [1]( first
child? ) everywhere.

  Cheers,
  Omprakash.V









-----Original Message-----
From: Joris Gillis [mailto:roac@xxxxxxxxxx]
Sent: Saturday, February 12, 2005 3:49 PM
To: xsl-list@xxxxxxxxxxxxxxxxxxxxxx
Subject: Re:  WordML to XML


Tempore 03:14:57, die 02/12/2005 AD, hinc in
xsl-list@xxxxxxxxxxxxxxxxxxxxxx scripsit Vasu Nanjangud
<vasdeep@xxxxxxxxx>:

> Joris, et al...
>
> My requirement is specifically to convert wordML to
> xml. i.e. strip off the "wordML" specific tags, but
> retain the "formatting instructions".
>

When you apply this template to your WordML doc:

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
version="1.0"
xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml"
exclude-result-prefixes="w">
   <xsl:output method="xml" version="1.0" encoding="UTF-8"
indent="yes"/>

<xsl:template match="w:wordDocument">
	<vasuarticletag>
		<xsl:apply-templates select=".//w:body"/>
	</vasuarticletag>
</xsl:template>

<xsl:template match="w:r ">
	<xsl:apply-templates select="(w:rPr|w:t)[1]"  mode="styling"/>
</xsl:template>

<xsl:template match="w:rPr"  mode="styling">
	<xsl:apply-templates select="*[1]" mode="styling"/>
</xsl:template>

<xsl:template match="w:u|w:b|w:i" priority="5"  mode="styling">
	<xsl:element name="{local-name()}">
		<xsl:apply-templates select="(following-sibling::*|../../w:t)[1]"
mode="styling"/>
	</xsl:element>
</xsl:template>

<xsl:template match="w:rPr/*"  mode="styling">
	<xsl:apply-templates select="(following-sibling::*|../../w:t)[1]"
mode="styling"/>
</xsl:template>

<xsl:template match="w:t"  mode="styling"><xsl:value-of
select="."/></xsl:template>

</xsl:stylesheet>



you will get this output:

<vasuarticletag>
<b>
    <i>
      <u>
          I have bold, italics and underscore
      </u>
    </i>
</b>
</vasuarticletag>


(I added modes, so the algorithm will not interfere with other templates
you might be using.)


> So, I need help in writing an xslt which will
> 1. traverse through every "w:r" block.
done.

> 2. Look for "w:rPr" tags with "w:i", "w:b" , "w:u"
> children.
possible, but not necessary.
> 3. If they exist, output <i>, <b>, <u> tags, then
> output the contents of the corresponding "w:t" block
> and then close the <i>, <b>, <u> tags.
This is not the way XSLT works. In XSLT you construct elements, which are
added to the result tree. When the XSLT processing is done, a serializer
outputs these elements as tags.
In other words, you cannot "open a tag" nor "close a tag" in XSLT. (unless
your messing with it as if it was character data, but don't do that)

While step 3 of your the algorithm, can't be used in XSLT. The templates
in my stylesheet will give output exactly as if they were following step 3.


regards,
--
Joris Gillis (http://www.ticalc.org/cgi-bin/acct-view.cgi?userid=38041)
Gaudiam omnibus traderat W3C, nec vana fides

Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.