[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: The impact of data format selection on applicationdevelopm

  • From: Roger L Costello <costello@mitre.org>
  • To: "xml-dev@lists.xml.org" <xml-dev@lists.xml.org>
  • Date: Tue, 12 Jul 2022 11:51:12 +0000

RE:  The impact of data format selection on applicationdevelopm
Hi Folks,

I have been having a discussion with a colleague on this topic. I thought it would be useful to share part of our discussion. 

[To remind people what the topic is about, we are trying to come up with rules/heuristics on when to use super-simple data formats versus when to use comparatively complicated data formats such as XML or JSON.]

My colleague argues that a JSON document containing a list of numbers or an XML document containing a list of numbers is superior to a plain text file containing a list of numbers because the JSON specification sets limits on the numbers and similarly if there is an XSD associated with the XML document it will set limits on the numbers, whereas there are no limits on the numbers in a plain text file. I think there is a problem with that argument. Suppose I create a document, using a super-simple data format, to express the length of rivers in the U.S., e.g.,

Missouri River	2,341
Mississippi River	2,340
Yukon River	1,979
Rio Grande	1,759

When I provide that data (file) to someone I will inform them:

Hey, the file consists of the lengths of rivers in the U.S. Each line of the file contains two fields: the U.S. name of a river and its length. The fields are separated by a tab. The length is expressed in miles as an integer and groups of digits are separated by the comma symbol (such as 1,759).

Without that explanation, the file (data) is useless. But that holds true for an XML file containing the same data and a JSON file containing the same data. One might argue that with XML the tags describe the data, so an accompanying explanation is not needed. But relying on XML tags to explain data is folly (e.g., what if the developer uses generic tags such as <li>, such tags hardly "explain" the data). I would argue, regardless of the data format, there needs to be some accompanying explanation about the data. And if that's the case, then heck, use the simplest possible data format (use the super-simple data format shown above) and take advantage of the plethora of tools available for processing super-simple data formats.

Comments?

/Roger



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.