[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Binary versus Text

  • From: Steve Newcomb <srn@coolheads.com>
  • To: xml-dev@lists.xml.org
  • Date: Sun, 24 Nov 2013 10:39:51 -0500

Re:  Binary versus Text
As a practical matter, is there *any* difference between text and binary
data *other* than the necessity of worrying about record-end handling in
the former, but not in the latter?

The problem with John's definition is that it begs the question, "What
does 'interpretation as a stream of characters' mean?"  For many years,
the Wall Street Journal's editorial policy was to describe data sizes as
numbers of characters or, for large numbers, Encyclopedia Brittanicas,
irrespective of whether the data were in fact characters.  The policy
was justified, not only because of the erstwhile naivete of their reader
base, but also, I would argue, because the distinction between
interpretability-as-characters and the lack thereof is not clear.
Regular expressions, for example, are quite useful for detecting
patterns in purely numeric data streams.  Does that make such streams
*text*?

Charles Goldfarb used to say, "If there are bugs in a text-processing
system, at least one of them will have to do with record-end handling."
That rule-of-thumb is perhaps dated now, but the absurd
human-productivity-diminishing differences between Unix, Microsoft, and
Apple record-end conventions are still very much with us.  It's sort of
like  the railroad gauges that did (or did not) descend from the
distances between pairs of ancient chariot wheels; they are the echoes
of empires.  (AFAIK, you still have to change trains, or at least
undercarriages, when crossing into Ukraine.)



On 11/24/2013 09:25 AM, Costello, Roger L. wrote:
> Hi Folks,
> 
>  
> 
> Distinguishing "text" versus "binary" is important.
> 
>  
> 
> On October 30 we had a discussion titled, "Is the binary file format dead?"
> 
>  
> 
> During that discussion John Cowan made an excellent distinction between
> binary and text files. I thought it would be useful to summarize the
> distinction.
> 
>  
> 
> The universe of computer files falls into two categories:
> 
>  
> 
> 1. Binary files
> 
> 2. Text files
> 
>  
> 
> By convention we normally restrict "binary" to files which are not
> interpretable as streams of characters. [John Cowan]
> 
>  
> 
> The word "text" is applied to files which are interpretable as streams
> of characters.
> 
>  
> 
> Of course any text file is also a binary file, since the class of text
> files is obtained from the class of binary files by applying
> restrictions. But it would be confusing to call a text file a binary
> file; it would be like calling a cat a mammal: correct but imprecise.
> 
>  
> 
>  
> 
> /Roger
> 


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.