[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Need an XPath 2.0 expression that identifies a lon
for $nb in ' ' return (p[. eq $nb and following-sibling::*[position() le 10][self::p]][. eq $nb])[1] On Mon, Oct 14, 2019 at 10:08 AM Costello, Roger L. costello@xxxxxxxxx < xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> wrote: > Hi Folks, > > As you may know, when a formatted email message is created in Outlook, > Outlook generates HTML under the hood. > > I am trying to determine if a formatted email message has text at the > bottom of the email message that is separated from the rest of the email by > a lot of space. In other words, the text at the bottom of the underlying > HTML is preceded by a bunch of non-blocking space characters ( ). > > Assume the HTML has been converted to XHTML. > > I need an XPath 2.0 expression that identifies a long block of > non-blocking space characters. > > Outlook generates HTML like that shown below. The non-blocking space > character is nested inside an <o:p> element, which is nested inside a <p> > element. > > I came up with this XPath expression: > > //p[o:p eq ' '][count(following-sibling::*[position() le 10][name() > eq 'p'][o:p eq ' ']) ge 10][1] > > It says, "Give me the first <p> element containing a non-blocking space > character such that there are at least 10 <p> elements that immediately > follow it, each containing a non-blocking space character." At least, > that's what I think it says. Note: 10 is an arbitrary number. > > Questions: > 1. Do you see any problems with the XPath expression? > 2. Is there a better XPath expression? > > <html xmlns:o="urn:schemas-microsoft-com:office:office"> > <p class="MsoNormal">top text<o:p/></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal"><o:p> </o:p></p> > <p class="MsoNormal">bottom text<o:p/></p> > </html> > > -- Cheers, Dimitre Novatchev --------------------------------------- Truly great madness cannot be achieved without significant intelligence. --------------------------------------- To invent, you need a good imagination and a pile of junk ------------------------------------- Never fight an inanimate object ------------------------------------- To avoid situations in which you might make mistakes may be the biggest mistake of all ------------------------------------ Quality means doing it right when no one is looking. ------------------------------------- You've achieved success in your field when you don't know whether what you're doing is work or play ------------------------------------- To achieve the impossible dream, try going to sleep. ------------------------------------- Facts do not cease to exist because they are ignored. ------------------------------------- Typing monkeys will write all Shakespeare's works in 200yrs.Will they write all patents, too? :) ------------------------------------- Sanity is madness put to good use. ------------------------------------- I finally figured out the only reason to be alive is to enjoy it.
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|