[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Blueberry/Unicode/XML

  • From: John Cowan <cowan@m...>
  • To: Jonathan Borden <jborden@m...>
  • Date: Wed, 11 Jul 2001 23:00:37 -0400 (EDT)

xml full width
Jonathan Borden scripsit:

> Aside for perhaps arbitrary (perhaps not :-) decisions about what characters
> ought or ought not be used to name things, what are these "good reasons"?
> 
> I specifically include in "good reasons":
> 
> 1) useful pieces of code that would break
> 2) hindrances to the development of useful pieces of code

The main point is that it wouldn't be plain text any more.  If XML is just a
binary format, something that no human being ever looks at, then
ASCII markup is plenty: you can tag everything x1, x2, x3, ....

But there are many Unicode characters that are very similar to others,
such as the halfwidth-fullwidth case that's been talked about already,
or the 127 (:-)) kinds of stars, or the various kinds of whitespace
that aren't, and so on.

> I am not limiting the list to these two, but I would like to develop a
> practical way of deciding these very important issues. Clearly any way this
> is decided, tradeoffs are to be made, and I want to give strong weight to
> practical consequences -- just to be clear, I place a high value on the
> ability of humans to read XML, including its markup.

Limiting names to linguistic representations, and univocal ones,
makes it much less likely that they'll be mixed up with one another,
leading to confusion or even fraud.

> But honestly I am hardly a unicode expert, its just that my perhaps naive
> impression is that given whatever nastly confusing problems that might occur
> using weird unicode characters in names, could as easily be replicated using
> nasty confusing -- yet well-formed -- names in XML as it stands today.
> Please educate me otherwise (i.e. this is just my impression).

The existing situation *can* be problematic: a capital alpha can be
subsituted for a capital A, indistinguishably to a human being,
for instance.  That is annoying.  Allowing in the non-alphanumeric
characters can only make it worse.

-- 
John Cowan                                   cowan@c...
One art/there is/no less/no more/All things/to do/with sparks/galore
	--Douglas Hofstadter

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.