[Home] [By Thread] [By Date] [Recent Entries]

  • From: "Michael Kay" <mike@s...>
  • To: "'Jon Noring'" <jon@n...>,<xml-dev@l...>
  • Date: Sun, 7 Oct 2007 04:39:27 +0100

The Saxon DTDGenerator only handles one input document, but I've used it for
this scenario in the past, simply by concatenating all the documents within
a dummy root element, generating the DTD, and then discarding the dummy root
element definition. (It doesn't need to be a physical concatenation, you can
set up a document that brings all the others in as external entities).

I produced a new version of the DTDGenerator a couple of weeks ago, as it
happens - the first release for 6 years, fixing a bug that caused it to
slowly eat memory in proportion to the number of distinct attribute values.
After fixing that, I was able to process 20Gb of input in about 25 minutes.
It's available as a free-standing download from the Saxon project on
sourceforge.

Michael Kay
http://www.saxonica.com/ 


> -----Original Message-----
> From: Jon Noring [mailto:jnoring@g...] On Behalf Of Jon Noring
> Sent: 06 October 2007 21:10
> To: xml-dev@l...
> Subject:  Auto-generate a DTD from multiple XML documents?
> 
> Everyone,
> 
> Several of us involved with Distributed Proofreaders and 
> Project Gutenberg are analyzing a number of TEI documents 
> representing PG etexts.
> 
> As part of the analysis, we'd like to find an application 
> which will autogenerate a minimal DTD (or schema) for 
> validating a given set of documents sharing the same root element.
> 
> I have seen several applications which will autogenerate a 
> DTD for a single XML document, but have not yet found one 
> which will generate a DTD representing a set of XML documents.
> 
> Of course, we prefer a "free" application for this one-time 
> job, but if there's a good commercial package, hopefully we 
> can find someone with a copy who'd be happy to volunteer 
> running the job for us.
> 
> Thanks in advance.
> 
> Jon Noring
> 
> 
> 
> ______________________________________________________________
> _________
> 
> XML-DEV is a publicly archived, unmoderated list hosted by 
> OASIS to support XML implementation and development. To 
> minimize spam in the archives, you must subscribe before posting.
> 
> [Un]Subscribe/change address: http://www.oasis-open.org/mlmanage/
> Or unsubscribe: xml-dev-unsubscribe@l...
> subscribe: xml-dev-subscribe@l... List archive: 
> http://lists.xml.org/archives/xml-dev/
> List Guidelines: http://www.oasis-open.org/maillists/guidelines.php
> 



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member