|
[XSL-LIST Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] RE: SAXON and UTF-8
Michael Kay writes:
> > Windows Notepad saves UTF8 files with Byte Order Mark, and
> > AFAIK, the XML
> > parser in Saxon (AElfred) doesn't support this (at least it
> > didn't last time I checked).
> >
>
> The question is, can an XML document (or entity) in UTF-8 encoding start
> with a BOM? The fact that Unicode allows it, and the fact that Notepad can
> create it, doesn't make it legal XML.
>
> My reading of the XML spec is that it expects to find BOM only in UTF-16
> files. I can't see any total prohibition of a BOM in a UTF-8 file, but the
> spec certainly seems to assume that they won't occur. If anyone thinks
> otherwise, I'd like to see evidence from the XML specification, which is the
> only definitive source.
>
> This is of course totally off-topic for XSLT.
At the risk of straying further off topic ...
It's my understanding that UTF-8 is an 8 bit encoding in which there
are certain "prefix" octects which control the meaning of some number
of subsequent octets.
Does it make any sense for an 8 bit encoding to have a byte order
mark. It is after all already an ordered stream of bytes.
Since this is unrelated to XSLT, please reply to me directly at
naha@xxxxxxxxxx
XSL-List info and archive: http://www.mulberrytech.com/xsl/xsl-list
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|

Cart








