[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: XML and mainframes, yet again (was RE: So


hex 85
At 8:49 PM -0500 12/16/01, John Cowan wrote:


>EBCDIC vs. ASCII is perfectly irrelevant to this discussion: mainframes
>can work with ASCII files as well as EBCDIC files, but in either
>case the NEL character (encoded as hex 85 in ASCII encoding or
>hex 15 in EBCDIC encoding) is the native line delimiter.
>

Again, that's simply not true. There is no NEL in ASCII. A document 
that includes NEL (or any other byte above #7F) is not an ASCII 
document.

>>  XML should work with the standard semantics for each character. The
>>  standard understanding of NEL is (in rough order of actual usage):
>>
>>  *  The three-dot ellipsis
>>  *  A missing glyph box
>>  *  Latin capital letter O with diaresis
>>  *  Many other characters
>
>Not at all.  The *character* #x85 means either a line break or nothing at
>all.  The *hex byte* 85 has the multiple meanings you mention, because
>it encodes the character #x2026 in your first case, and the character
>#D6 in your third case.
>

Your understanding of meaning is as defined by the spec. My 
understanding of meaning is what actually gets presented to an end 
user by real software. Present  character 85 to current non-mainframe 
software however you like. Short of turning it into the byte #0A or 
the byte #0D, it's not going to have the meaning "line break".

>In order to understand the issues, it's *critical* not to mix up
>characters and bytes.
>

I'm fully aware of the difference. And yet our software does mix up 
characters and bytes. Even when presented with a document with a 
properly labeled encoding, lots of common software still makes 
mistakes like this.
-- 

+-----------------------+------------------------+-------------------+
| Elliotte Rusty Harold | elharo@m... | Writer/Programmer |
+-----------------------+------------------------+-------------------+
|          The XML Bible, 2nd Edition (Hungry Minds, 2001)           |
|              http://www.ibiblio.org/xml/books/bible2/              |
|   http://www.amazon.com/exec/obidos/ISBN=0764547607/cafeaulaitA/   |
+----------------------------------+---------------------------------+
|  Read Cafe au Lait for Java News:  http://www.cafeaulait.org/      |
|  Read Cafe con Leche for XML News: http://www.ibiblio.org/xml/     |
+----------------------------------+---------------------------------+

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.