XML design: why is > allowed in attribute value?

Cart

XML Editor - Download a Free Trial >

See What's New >

Buy Now >

[Home] [By Thread] [By Date] [Recent Entries]

From: Daniel Barclay <Daniel.Barclay@d...>
To: xml-dev@l...
Date: Fri, 01 Dec 2000 17:19:03 -0500


I've been wondering something about the design of XML.

Why are unencoded greater-than (">") characters allowed in attribute 
values?

I would have thought that greater-than characters inside a tag
(that is, excluding the one terminating the tag) would have been
disallowed, to make it easy for a scanner to identify the ends
of tags without having to parse attributes and their quotation 
characters.


XML does require that less-than characters in attribute values be 
encoded.

It seems that the purpose of this requirement was to make it easy to 
identify the beginnings of tags by simply finding less-than characters,
without having to keep track of whether they appear in attribute-value 
quotes and therefore don't actually signal tags.  (Yes, I'm ignoring 
comments and CDATA sections.)

Is that reasoning correct?  If so, why wouldn't greater-than characters 
be treated similarly, to similarly simplify finding the ends of tags?

(Yes, I know a full parser has to parse everything, but some applications
(e.g., syntax highlighting) might just want to identify the beginnings 
and ends of tags.)

Curiously,

Daniel
-- 
Daniel Barclay
Digital Focus
Daniel.Barclay@d...

Prev by Date: RE: XML Schemas: Best Practices
Next by Date: Re: XML Schemas: Best Practices
Previous by thread: DTD Question
Next by thread: The Mire 'twixt Documents And Data
Index(es):
- Date
- Thread

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >