[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: XML Vocabularies for Large Systems - 3 Philosophically Dif

  • To: "Roger L. Costello" <costello@m...>
  • Subject: Re: XML Vocabularies for Large Systems - 3 Philosophically Different Approaches
  • From: Peter Hunsberger <peter.hunsberger@g...>
  • Date: Mon, 13 Dec 2004 14:44:22 -0600
  • Cc: xml-dev@l...
  • Domainkey-signature: a=rsa-sha1; q=dns; c=nofws; s=beta; d=gmail.com; h=received:message-id:date:from:reply-to:to:subject:cc:in-reply-to:mime-version:content-type:content-transfer-encoding:references; b=Tu1aykL5fKIlS83bT1GqX06PFjxUhczfj5zCJopj+1OkvfA4bB//c3o7lkKE1ebv0w4CogG7mugls8o6lmvYsDb5ydud0WgllBAm5W1auNU+Vw+/qWw4KWsCoqiwHyYsHLQq2AQZyo1phPVJZxebQevYzjsokKmb6JivQfP3r1Q=
  • In-reply-to: <200412121953.iBCJrv912122@s...>
  • References: <200412121953.iBCJrv912122@s...>
  • Reply-to: Peter Hunsberger <peter.hunsberger@g...>

radiation vocabulary
On Sun, 12 Dec 2004 14:53:49 -0500, Roger L. Costello
<costello@m...> wrote:

<snip>introduction</snip>

> 
> ISSUE - NATURE OF XML VOCABULARIES FOR LARGE SYSTEMS
> 
> I identify three philosophically different approaches to the creation of
> an XML vocabulary for a large system:
> 
>   a. Create multiple, simple XML vocabularies.
>   b. Create a single, simple XML vocabulary that is used in multiple ways.
>   c. Create a single, large, complex XML vocabulary.

<snip>discussion</snip>
> 
> QUESTIONS
> 
> Have you implemented a large system?  Have you created an XML vocabulary for
> a large system?  Which of the above three approaches did you take? I am
> particularly interested in hearing from people who have used simple XML
> vocabularies [approach (a) or (b)] to achieve all the data complexities
> in a large system.

I guess it depends on how you define large?  We are building out a
single metadata driven system to collect data across multiple clinical
trials and protocols and related areas of interest (eg. tissue sample
tracking, international tumor registries). So far we've built out
about 1600 screens across about 30 different service areas (protocols,
whatever) tracking over 7500 data points across over 450 collections
of data points. (A collection roughly corresponds to a table in a
traditional relational database and there may be several screens for
the same collection of data points, no service area uses all the
collections defined within the system).

Our approach comes down to a cross between a),  b) and c)....  We have
about 6 relatively simple vocabularies that are used in multiple ways.
 However, our metadata vocabulary is an oddball. If you count elements
it is large (on the order of 7500 + 450 definitions).  However all of
these elements are built by extending a master vocabulary that is
essentially 6 or so main definitions and another (relatively static)
metadata table that defines perhaps another 200 primary relationships
within the system (which everything else must extend).  For example,
at the simplest level we have definitions of "collection" and
"object".  A instance of a collection might be a "therapy" which has
well defined relationships to diagnosis and protocol (among other
things).  An instance of a therapy might be "surgery" or "radiation",
containing 15 to 35 objects.  The business analysts deal only with the
high level abstract concepts and the system hides the details except
on the rare occasions we have to generate a schema for external
exchange purposes (in which case the schema is generated from the
metadata specific to the needs at the time).

We also have our own superset of Schematron for validation that
enables a validation editor GUI.  Don't know how Scheamtron fits in
your large/small vocabulary spectrum?

Over all, now that I know what I'm doing, for our system, I lean
towards a) where "multiple" is a number less than 10.  Use
abstraction, but keep the abstractions to terms that are familiar
within the broad domain (eg, screen layout, validation, business
areas).  Approaches like RDF also make sense for gluing relationships
together and a very simple model for relationship description is
something I'm trying to come to grips with: how far to you want to go
in reducing everything to just another relationship before the
metadata looses it's applicability as a domain specific model, and
thus becomes difficult to model with?

-- 
Peter Hunsberger

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.