[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: Auto-generate a DTD from multiple XML documents?

  • From: "Michael Kay" <mike@s...>
  • To: "'Jon Noring'" <jon@n...>,<xml-dev@l...>
  • Date: Sun, 7 Oct 2007 04:39:27 +0100

RE:  Auto-generate a DTD from multiple XML documents?
The Saxon DTDGenerator only handles one input document, but I've used it for
this scenario in the past, simply by concatenating all the documents within
a dummy root element, generating the DTD, and then discarding the dummy root
element definition. (It doesn't need to be a physical concatenation, you can
set up a document that brings all the others in as external entities).

I produced a new version of the DTDGenerator a couple of weeks ago, as it
happens - the first release for 6 years, fixing a bug that caused it to
slowly eat memory in proportion to the number of distinct attribute values.
After fixing that, I was able to process 20Gb of input in about 25 minutes.
It's available as a free-standing download from the Saxon project on

Michael Kay

> -----Original Message-----
> From: Jon Noring [mailto:jnoring@g...] On Behalf Of Jon Noring
> Sent: 06 October 2007 21:10
> To: xml-dev@l...
> Subject:  Auto-generate a DTD from multiple XML documents?
> Everyone,
> Several of us involved with Distributed Proofreaders and 
> Project Gutenberg are analyzing a number of TEI documents 
> representing PG etexts.
> As part of the analysis, we'd like to find an application 
> which will autogenerate a minimal DTD (or schema) for 
> validating a given set of documents sharing the same root element.
> I have seen several applications which will autogenerate a 
> DTD for a single XML document, but have not yet found one 
> which will generate a DTD representing a set of XML documents.
> Of course, we prefer a "free" application for this one-time 
> job, but if there's a good commercial package, hopefully we 
> can find someone with a copy who'd be happy to volunteer 
> running the job for us.
> Thanks in advance.
> Jon Noring
> ______________________________________________________________
> _________
> XML-DEV is a publicly archived, unmoderated list hosted by 
> OASIS to support XML implementation and development. To 
> minimize spam in the archives, you must subscribe before posting.
> [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
> Or unsubscribe: xml-dev-unsubscribe@l...
> subscribe: xml-dev-subscribe@l... List archive: 
> http://lists.xml.org/archives/xml-dev/
> List Guidelines: http://www.oasis-open.org/maillists/guidelines.php

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
First Name
Last Name
Subscribe in XML format
RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.