Table of contentsAppendices |
3.6 StringsStringsStrings consist of a sequence of zero or more characters, where a character is defined as in the XML Recommendation XML. A single character in XPath thus corresponds to a single Unicode abstract character with a single corresponding Unicode scalar value (see UNICODE); this is not the same thing as a 16-bit Unicode code value: the Unicode coded character representation for an abstract character with Unicode scalar value greater that U+FFFF is a pair of 16-bit Unicode code values (a surrogate pair). In many programming languages, a string is represented by a sequence of 16-bit Unicode code values; implementations of XPath in such languages must take care to ensure that a surrogate pair is correctly treated as a single XPath character. NOTE: |