[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: Soft Landing

  • From: Carol Ellerbeck <carol@f...>
  • To: 'KenNorth' <KenNorth@e...>,"'Bullard, Claude L (Len)'" <clbullar@i...>
  • Date: Tue, 24 Oct 2000 07:43:25 -0400

RE: Soft Landing
Ken,
If you "were king of the world" with the idea you express below, you would
not need "an unlimited budget"...just a modest one, to have experts build
your taxonomy/domain vocabularies.  I say this as a Taxonomist who has been
in the vocabulary trenches with electronic information for years.
Automation is wonderful (and I would say, even essential), but start with
*NOT JUST* humans (albeit smart humans), start with humans who have some
expertise, and you will accomplish your goal faster, with fewer people, more
efficiently, and have a more solid foundation to build on.......
Long live the king!

C

-----Original Message-----
From: KenNorth [mailto:KenNorth@e...]
Sent: Monday, October 23, 2000 9:44 PM
To: Bullard, Claude L (Len)
Cc: xml-dev@l...
Subject: Re: Soft Landing



> > I always felt that feeding them
> > automagically from services such as full-text
> > indexing and analysis was dicey.  If you
> > use semantic nets to create semantic nets, it
> > is a bit like using an a-bomb to detonate
> > an h-bomb.

If I were king of the world, with unlimited budget and unlimited
cooperation, I'd start with a taxonomy and domain experts. Let them define a
domain vocabulary (again I keep pointing to MeSH for medical literature).

Then, when new literature is published each month, run it through machine
analysis to identify new terms that start popping up in the literature
(e.g., XML a few years ago). Also identify relationships to existing
concepts or terms (similarity searches), and so on.  The domain experts
identify an alert level (e.g., 5 citations) and when a term or concept
exceeds that level, it's included in a monthly update they receive -- new
terms and concepts in the literature. They use that information when
updating domain vocabularies on a quarterly basis.

Using a pre-defined domain vocabulary is probably more efficient than doing
it all automagically using inference engines, machine analysis of schemas,
RDF, parsing and so on.

Look at the portals that migrated to a classification scheme, instead of
being simply keyword container searches.












PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.