[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: 3 XML Design Principles

  • To: "Roger L. Costello" <costello@m...>
  • Subject: Re: 3 XML Design Principles
  • From: Kurt Cagle <kurt.cagle@g...>
  • Date: Sat, 29 Jan 2005 22:30:42 -0800
  • Cc: XML Developers List <xml-dev@l...>
  • Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:references; b=JBNO2dJxO3/pjTurKYN1kZf11wa/0TSnZ28ZS2Ureccn99Kcr57W/pw7eFvbsoii8GzgiGWM9vtfru87hdUx4qc7ICwUjVQaKs/BVLQHy3G8WJKyg2JXqvBeAPboN/LC6dQb/29dAYFBrmN1UvWVjhvEwWi7sh2rqer4f+gjMgY=
  • In-reply-to: <200501291720.j0THK3517843@s...>
  • References: <200501291720.j0THK3517843@s...>
  • Reply-to: Kurt Cagle <kurt.cagle@g...>

cobb s 12 rules
Roger,

I think to a certain extent that you're applying Ted Cobb's "12 rules
of database design" to XML - in essence, what you've done here is
given in XML a number of key normalization rules. The challenge that I
find when attempting to do is the fact that such normalization can
only occur in situations where there is what I term a low complexity
to the schema (I address this in a distinctly un-user-friendly paper
called <a href="http://www.understandingxml.com/archives/2005/01/information_los.html">Information
Loss and Schema Complexity</a>, which represents some thoughts I've
had on gaining a handle on the potentially complex nature of XML. It's
pretty heavy reading, though I admit that I backed away from going to
a more formal mathematical notation when I realized just how
intimidating it looked).

Keep in mind that in areas where you have complex documents (ones in
which there are a large number of potential states for a given
sequence of nodes in the document tree), normalization becomes much
more difficult to maintain. Similarly, the more that you normalize
content, the higher the amount of hierarchical restructuring becomes
necessary when transforming that content into other representations
(especially in terms of filters where you have multiple distinct flat
structures that may have deep referential relationships).

This is not to say that the rules that you're expressing aren't
important - in general, you are exactly correct in saying that the
best form that a schema can take (what I refer to as a canonical form
in the same paper)  is one where fields are effectively decoupled as
much as possible, and where relationships are explicit. In a simple
schema as you propose above, this canonicalization is obvious, but in
a high complexity schema (such as a DocBook document) such
canonicalization would be both expensive and largely irrelevent, as it
is precisely the relationships themselves - the container/contained
relationships in particular, that ARE the critical parts of the
document.

-- Kurt Cagle
-- UnderstandingXML



 


-- 
Kurt Cagle
http://www.understandingxml.com

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Cast Your Vote

We need your help – Vote for DataDirect XML Products!

  • Best SOA or XML site

Winners and finalists announced at SOA World Conference in November.

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2007 All Rights Reserved.