[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: collection() and uncommon file extensions

Subject: Re: collection() and uncommon file extensions
From: "Michael Kay mike@xxxxxxxxxxxx" <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx>
Date: Thu, 15 Nov 2018 20:58:53 -0000
Re:  collection() and uncommon file extensions
Everything about the collection() function is very implementation-specific, so
this is really a Saxon question rather than an XSLT question. (And no, there
are no plans to define standards in this area, though it would be nice.)

The way you are going about it looks right to me. It's probably failing
because of some detail that you didn't realise was important. I know it's
difficult to put together a repro for this kind of problem but that's really
what we need.

Around 40 years ago I worked with an operating system that knew the content
type of each file. Shame the idea didn't catch on.

Michael Kay
Saxonica



> On 15 Nov 2018, at 19:32, Martin Holmes gtxxgm-xsl-list-2@xxxxxxxxxxx
<xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx> wrote:
>
> Hi all,
>
> The recent changes to XPath
(https://www.w3.org/TR/xpath-functions-31/#func-collection) have introduced
the capability for the collection() function to retrieve non-XML documents as
well as XML documents. However, that has broken some processes I have where
XML documents with different extensions are being retrieved. For instance,
where this:
>
> collection('dir/?*.hocr')
>
> used to happily retrieve and parse HOCR files (which are actually XHTML),
Saxon now treats these files as xs:base64Binary items, and won't parse them,
even though they have XML declarations.
>
> I know that the recommended approach to dealing with this is to use a Saxon
configuration file to register the file extension -- which I presume would be
done like this:
>
> <resources>
>  <fileExtension extension="hocr" mediaType="text/xml"/>
> </resources>
>
> However, this doesn't seem to work for me -- do I have that syntax wrong?
>
> Also, the conf file approach isn't easily portable, so I'm wondering if
there are any plans to enable the media type to be specified on the
collection() function itself, or to be registered in an XSLT document
somehow?
>
> Cheers,
> Martin

Current Thread

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.