[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: UTF-8+names


non break space

Bob Foster wrote:
> 
> Asking for requirements is always a good idea.
> 
> I think users want not to lose the following when they use non-DTD
> validation:
> 
> - Internal entities for common well-known entity sets, like 
> those of XHTML, MathML, etc.
> - Internal entities for user-defined shorthand
> - External parsed entities (includes)
> 
> If internal entities with simple definitions (i.e., no use of 
> parameter
> entities) were the sole requirement and users were willing to 
> have all the entities used in a document defined in the 
> internal subset of every document, then you are right. An 
> editor could easily insert those definitions.


This is one solution that is fine if you want to represent the "difficult"
characters as entity references within the XML document and are willing to
include a number of internal entities in the document.

The other solution I was hinting at is for tools to implement aids for
input/display of difficult characters, limited to the user interface, with
the actual Unicode character being included in the document (i.e., not as an
entity reference).

This would allow, for example, display of an existing XML document
containing difficult characters, by showing the names of those characters to
the user.  (The names can either be shown within an auxiliary view such as
tooltips, or shown in-line.)   No prior transcoding to another encoding
(such as UTF-8+names) would be necessary for displaying an existing document
in this way.

(How does the UTF-8+names proposal address this particular use case?  Prior
conversion to UTF-8+names, perhaps?  And how would one choose among the
multiple encodings available for different characters?  U with diaeresis or
ü?  Greek capital letter Psi or ? ?  and so on.  Should the bytes of the
resulting UTF-8+names -encoded XML document depend, for example, on the
particular linguistic preferences of the user that has touched the document
last?)

Alessandro



> 
> Bob Foster
> 
> From: "Alessandro Triglia" <sandro@m...>
> To: "'Bob Foster'" <bob@o...>; "'Tim Bray'" 
> <tbray@t...>; "'Miles Sabin'" <miles@m...>
> Cc: <xml-dev@l...>
> Sent: Sunday, October 19, 2003 6:17 PM
> Subject: RE:  UTF-8+names
> 
> 
> 
> I have another comment.
> 
> What is that those users have actually been asking for?  What 
> is their actual need?  Do they want to be able to display 
> and/or enter a rare character when using a user interface 
> that doesn't support that character directly?
> 
> If so, isn't this entirely a software issue?  Can't existing 
> XML browsers and editors just be extended so as to support 
> *names* for characters, and we leave the encodings alone?
> 
> For example, if I want to enter a ?   (the cyrillic character) using a
> keyboard that does not support cyrillic, I can currently use 
> some OS-specific means (say, the character map applet in 
> Windows plus a
> copy/paste).    If an XML editor had the inherent ability to 
> accept any
> Unicode character by opening a dialog box showing a list of 
> Unicode names, that would be sufficient for many purposes.  
> Likewise, if an XML viewer had the ability to display the 
> Unicode name of a rare Unicode character when the cursor is 
> above a character, that could be sufficient for many 
> purposes. If some program needs to cope with display hardware 
> that doesn't know how to display a  ? , the software itself 
> can be written so as to show the Unicode name of the ?  
> (CYRILLIC CAPITAL LETTER SHCHA) or some shorter local 
> designation, instead of a small square.
> 
> Recalling one of the cases mentioned, can't an XML editor (as a
> product-specific feature) allow the user to enter  something 
> like   &nbsp;
> and change it on the fly to the Unicode character   NON-BREAK SPACE
> depending on the context?  Can't this XML editor subsequently 
> display a tool
> tip over the character?   Do we really need to  *encode*  the 
> NON-BREAK
> SPACE  as a byte sequence  & n b s p ;   ?
> 
> What fraction of those use cases would be left out, if the 
> issue were regarded as a software issue?
> 
> Alessandro
> 
> 
> 
> -----------------------------------------------------------------
> The xml-dev list is sponsored by XML.org 
> <http://www.xml.org>, an initiative of OASIS 
<http://www.oasis-open.org>

The list archives are at http://lists.xml.org/archives/xml-dev/

To subscribe or unsubscribe from this list use the subscription
manager: <http://lists.xml.org/ob/adm.pl>



PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.