[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: XML Blueberry

  • From: Elliotte Rusty Harold <elharo@m...>
  • To: David Carlisle <davidc@n...>, xml-dev@l...
  • Date: Fri, 22 Jun 2001 09:05:02 -0400

java linefeed
At 10:17 AM +0100 6/22/01, David Carlisle wrote:

>Do any files really use NEL that are encoded in utf-8 or utf-16 (or
>utf-8 subsets like ascii that don't need to be declared)?
>
>If all the files using NEL start
><?xml version="1.0" encoding="some-flavour-of-ebcdic"?>
>Then can't NEL be mapped to #10 (0r #13) in the non normative support
>for the ebcdic related encodings. This wouldn't require any change to XML.
>

This is a good idea. Maybe we can fix this part of the problem in the 
context of XML 1.0 without changing the spec. We'd need to define a 
new encoding of Unicode such as IBD-8. IBD-8 would be identical to 
UTF-8 except that normal UTF-8 representation of the NEL character 
would be mapped to the linefeed. Parsers would have the option to 
support or not support IBD-8 at their option, just like today they 
have the option whether or not to support all of IBM's various EBCDIC 
encodings.

XML aware tools would not need to be changed at all, especially if 
they don't want to support the new encoding. XML aware tools that did 
support the IBD-8 encoding would treat it like any other XML 
document. Non-XML-aware tools on IBM mainframes (e.g. text editors, 
println()-like methods in programming languages, etc.) would be able 
to work with the files in a natural native way. Non-IBD-8 aware text 
tools on other platforms would probably choke, but they do that 
anyway today when faced with strange encodings. On the other hand, 
UTF-8 savvy, non-XML-aware tools could still process these documents 
as they usually do.

And of course if UTF-8 isn't the variant that IBM wants, they can 
have IBD-16 (UTF-16), IBD4 (UCS4) etc. The encodings would be 
identical except that XML-aware tools would either translate the NEL 
characters to linefeeds or throw an error because they don't 
recognize the encoding. I think this might make everyone happy. Does 
anyone see a problem with this?



-- 

+-----------------------+------------------------+-------------------+
| Elliotte Rusty Harold | elharo@m... | Writer/Programmer |
+-----------------------+------------------------+-------------------+
|                  The XML Bible (IDG Books, 1999)                   |
|              http://metalab.unc.edu/xml/books/bible/               |
|   http://www.amazon.com/exec/obidos/ISBN=0764532367/cafeaulaitA/   |
+----------------------------------+---------------------------------+
|  Read Cafe au Lait for Java News:  http://metalab.unc.edu/javafaq/ |
|  Read Cafe con Leche for XML News: http://metalab.unc.edu/xml/     |
+----------------------------------+---------------------------------+

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.