Re: [Summary] Why is Encoding Metadata (e.g. encoding="UTF-8")

Cart

XML Editor - Download a Free Trial >

See What's New >

Buy Now >

[Home] [By Thread] [By Date] [Recent Entries]

From: richard@i... (Richard Tobin)
To: xml-dev@l...
Date: Fri, 21 Sep 2007 12:43:58 +0100 (BST)

In article <46F3929F.3020903@s...> you write:
>The tip for parsers is that when a strategy (BOM and others as specified 
>in the spec., or some other hazardous heuristic) lead to <?xml 
>encoding="XXX"?>, then XXX can be applied safely

I suppose it would be possible to construct an encoding whose name was
such that when misinterpreted as ascii it appeared to be "utf-8".
For example, suppose it was called "qyz-8", and was identical to
utf-8 except that u and q, t and y, and z and f were exchanged.
Then the qyz-8 string

  <?xml version="1.0" encoding="qyz-8"?>

would be misinterpreted as

  <?xml version="1.0" encoding="utf-8"?>

when read as ascii or utf-8.

-- Richard
-- 
"Consideration shall be given to the need for as many as 32 characters
in some alphabets" - X3.4, 1963.

References:
- Re: [Summary] Why is Encoding Metadata (e.g. encoding="UTF-8")putInside the XML Document?
  - From: Philippe Poulard <philippe.poulard@s...>

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >