|
[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: Gutenberg Project <longish>
I think that if you are starting with texts from Project Gutenberg, the DTD isn't too important. Do read the TEI DTD and use whatever you can from it, but remember also that the PG texts are far from scholarly in the first place. Michael Hart has checked provenance only so far as is necessary to ensure copyright status. The texts there are not always the "most familiar" versions[1], and markup such as italics or other emphasis is either lost or erndered as ALL CAPS. If you use the same structures as the TEI where possible, but names that seem clearer for what you're doing, for example, an automatic translation would be possible. This is actually a project very dear to me -- I had wanted so badly to set up an archive of marked-up texts (then in SGML) when I worked at SoftQuad, but the world wasn't ready for it. The texts at the Oxford Text Archive were not freely redistributable, and there were few other sources at that time. Lee -- Liam Quin, Open Source / SGML / XML / Unix / Perl consultant, Toronto http://www.holoweb.net/~liam/ Ankh on irc.sorcery.net barefoot on the net! co-oauthor, The XML Specification Guide, Wiley Inc. forthcoming: The Open Source XML Database Toolkit... *************************************************************************** This is xml-dev, the mailing list for XML developers. To unsubscribe, mailto:majordomo@x...&BODY=unsubscribe%20xml-dev List archives are available at http://xml.org/archives/xml-dev/ ***************************************************************************
|
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|
|||||||||

Cart








