[Home] [By Thread] [By Date] [Recent Entries]

  • From: noah_mendelsohn@u...
  • To: "Costello, Roger L." <costello@m...>
  • Date: Wed, 5 Sep 2007 11:29:46 -0400

Roger Costello writes:

> Can you think of other problems that may result from copying text from
> one document and pasting it into an XML document?

They might be characters that are not legal in XML at all.  See [1] which 
says:

"Well-formedness constraint: Legal Character

Characters referred to using character references MUST match the 
production for Char."

In XML 1.0 that's:

"Char      ::=          #x9 | #xA | #xD | [#x20-#xD7FF] |
                [#xE000-#xFFFD] | [#x10000-#x10FFFF]    /* any Unicode 
character, 
                                                           excluding the 
surrogate
                                                          blocks, FFFE, 
and FFFF. */" 

If you have any characters that don't fit this production, XML can't 
represent them, escaped or not.  XML 1.1 allows somewhat more, as I 
recall.

Noah

[1] http://www.w3.org/TR/REC-xml/#wf-Legalchar

--------------------------------------
Noah Mendelsohn 
IBM Corporation
One Rogers Street
Cambridge, MA 02142
1-617-693-4036
--------------------------------------






[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member