[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Confused about & in entity literal

  • From: roddey@u...
  • To: xml-dev@i...
  • Date: Mon, 10 May 1999 11:27:39 -0600

xml ampersand literal



>> In this one, there is definitely an ampersand in an entity literal which is
not
>> part of a numeric character reference or an intrinsic character reference.
The
>> spec does not seem to day "No raw & in an entity value unless its a numeric
ref
>> or intrinsic ref, or some other reference that's just left unexpanded",
right?
>> It just says that there can be no ampersands in an entity value unless its
part
>> of a numeric reference or an intrinsic reference.
>
>Have a closer look at production 9:
>
>    [9] EntityValue ::=
>         '"' ([^%&"] | PEReference | Reference)* '"' |
>         "'" ([^%&'] | PEReference | Reference)* "'"
>
>Which _does_ say that you can't have a raw '&' in an entity value etc.
>That's what the excslusion syntax means.

So its like you have to parse the entity value and, if you find an ampersand,
you have to parse it like an entity reference. If it happens to be either a
numeric reference or the name happens to match one of the intrinsic entity
names, then you should expand that and escape the character it generates?
Otherwise, if it happens to look like a reasonable entity reference, I guess you
are supposed to ignore it and just pass it through as is? If it does not look
like a reasonable entity reference, then you give an error?

What about this scenario?

<!ENTITY Foo "&[insert 128K of what is really the rest of a base64 encoded or
encrypted piece of text];">

In this scenario, the entity is an encoded value of some sort, which just
happens to start with an ampersand and end with a semi-colon. It has no illegal
name chars in it and no spaces. You are supposed to buffer up all of that text,
and if it happens to end with a semicolon, assume that it is a legal reference
and pass it through?

Yes I know that the ampersands should have been escaped technically, but how
many parsers would blow up in this situation trying to buffer up that much text?
How would the end user of that text figure out again where the escaped
ampersands are in the text since its basically a totally meaningless sequence of
characters to begin with?

I know that these are pathological cases, but I just want to make sure that I
understand what's required and that everyone has throught it through before I
commit (yet again) to writing this part of the code, since I obviously flubbed
it slightly the first time around.



xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i...
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/ and on CD-ROM/ISBN 981-02-3594-1
To (un)subscribe, mailto:majordomo@i... the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@i... the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@i...)


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.