[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: XML CDATA sections ... the good, the bad, and the ugly

  • From: David Carlisle <d.p.carlisle@gmail.com>
  • To: "Costello, Roger L." <costello@mitre.org>
  • Date: Fri, 26 Jun 2015 14:01:08 +0100

Re:  XML CDATA sections ... the good
On 26 June 2015 at 13:52, Costello, Roger L. <costello@mitre.org> wrote:
> Hi Folks,
>
>
>
> The CDATA section is a mechanism for disabling the normal interpretation of
> XML syntax. For example, the title element in this XML document contains a
> script element:
>
>
>
> <title>
>     <script>...</script>
> </title>
>
>
>
> The title element in this XML document contains a string:
>
>
>
> <title>
>     <![CDATA[<script>...</script>]]>
> </title>
>
>
>
> Ordinarily <script> would be interpreted as a start tag but since <script>
> is embedded within a CDATA section the normal interpretation is disabled and
> <script> is treated simply as a string.
>
>
>
> There is nothing that XSLT programs can do about CDATA sections. That is
> because the XML parser removes the CDATA wrapper and creates a text node for
> its content. The CDATA wrapper is gone by the time the XSLT program gets the
> XML; all that remains is a text node.
>
>
>
> Nonetheless, some XSLT processors can be configured to instruct its XML
> parser to remove the CDATA sections. The approach for doing this varies with
> the XSLT processor. This article describes the approach used by one XSLT
> processor: http://sourceforge.net/p/saxon/mailman/message/34240016/.
>
>
>
> Any errors in this description? Anything you would add or delete?
>
>
>
> /Roger
>
>



No real errors but I'd add a strong warning that if you are needing to
do this then probably you should not be using an XML syntax.

Clearly it is possible to infer different meaning between

<x><![CDATA[zzz]]/x>

and

<x>zzz</x>

Just as it is between

<x    >zzz</x>

and

<x>zzz</x>

However XML processors are supposed to treat these as the same input
and if they start to treat them differently then that means that
producers of XML lose confidence in the format: things that are
supposed to be cosmetic stylistic choices in the output may have
unknown effects in future unknown readers of the document.

David


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.