Table of contentsAppendices |
2.4 Character Data and MarkupCharacter Data and MarkupText consists of intermingled Character Data and markup. Markup takes the form of Start-Tag, End Tag, Empty, Entity Reference, Character Reference, Comment, CDATA Section delimiters, Document Type Declaration, Processing instruction, XML declarations, text declarations, and any white space that is at the top level of the document entity (that is, outside the document element and not inside any other markup). All text that is not markup constitutes the character data of the document. The ampersand character (&) and the left angle bracket (<) MUST NOT appear
in their literal form, except when used as markup delimiters, or
within a Comment, a Processing instruction, or a CDATA Section.
If they are needed elsewhere, they MUST be escape
using either Character Reference
or the strings " In the content of elements, character data is any string of characters
which does not contain the start-delimiter of any markup or the
CDATA-section-close delimiter,
" To allow attribute values to contain both single and double quotes, the
apostrophe or single-quote character (') MAY be represented as "
|