[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: fault tolerant saxon:parse()

Subject: Re: fault tolerant saxon:parse()
From: "Andrew Welch" <andrew.j.welch@xxxxxxxxx>
Date: Mon, 17 Nov 2008 12:12:21 +0000
Re:  fault tolerant saxon:parse()
>> The former needs parsing if you want to process the escaped markup,
>> but if you do that with the latter you get an error
> but if you just parsed with tagsoup (or probably the others as well0 it
> would work in both cases, because in super-lax html parsing modes an &
> not followed by some letters and a semicolon parses as itself rather
> than an error.

Given this XML:

  <title>&lt;a href="foo.html"&gt;Today&lt;/a&gt;</title>
  <title>Hammersmith &amp; City</title>

and the need to process the <title> element to strip out the markup
(or some other requirement) - how would you incorporate tagsoup?

Currently I'm calling saxon:parse on the contents of the title
element, wrapped in a root node (as there's no guarantee of a single
root element):

<xsl:variable name="parsed-content"
select="saxon:parse(concat('&lt;root&gt;', saxon:parse(title),
<xsl:value-of select="$parsed-content"/>

Do I parse the entire XML using tagsoup?

Andrew Welch
Kernow: http://kernowforsaxon.sf.net/

Current Thread


Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
First Name
Last Name
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.