[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

ANN: Gorille (alpha)

  • To: xml-dev@l...
  • Subject: ANN: Gorille (alpha)
  • From: "Simon St.Laurent" <simonstl@s...>
  • Date: Sat, 22 Dec 2001 09:20:29 -0500

Gorille [1] is a very simple Java library for testing XML content and 
labels against lists of allowable Unicode characters like those provided in 
XML 1.0 [2] and XML 1.1 [2].  Gorille is available under the Mozilla Public 

Gorille uses an XML format to specify lists of characters according to 
either XML 1.0 conventions (with its BaseChar, Ideographic, CombiningChar, 
Digit, and Extender productions) or XML 1.1 conventions (NameStartChar, 
NameChar).  Both forms permit specification of the Char and S production 
for content characters and whitespace.  I've included sample lists for both 
XML 1.0 and XML 1.1, as well as an ASCII-only version of XML 1.0.

Gorille performs checking of Name, Names, NMTOKEN, and NMTOKENS, as well as 
character checking for any of the productions listed above.  This checking 
is performed by XML parsers as documents are parsed, but Gorille may be 
useful for checking XML documents generated by programs or to restrict 
documents to subsets of the characters allowed by XML.  Gorille relies 
completely on Java's built-in support for Unicode strings and characters, 
though it doesn't use any of the Unicode property information Java provides.

Gorille does provide for some rather perverse modifications of the 
productions - you could, for instance, require that all content be in 
control characters while all names be ideographic - but my hope is that 
developers will use it in reasonable ways which don't create arbitrary 
explosions as programs reject bad information.

I'll be using Gorille to provide name- and content-checking for MOE [4], 
but hope to also create a SAXFilter which uses it and perhaps a Java 
FilterReader for preprocessing content before it reaches a parser.

Gorille is currently in alpha. I believe the basic functionality is 
complete, but there's still potential for improvement, expansion, and as 
always, better documentation.  (Including RDDL documents for the character 
list and test files!)

[1] http://simonstl.com/projects/gorille
[2] http://www.w3.org/TR/REC-xml
[3] http://www.w3.org/TR/xml11/
[4] http://moe.sourceforge.net

Simon St.Laurent
Associate Editor, O'Reilly & Associates


Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
First Name
Last Name
Subscribe in XML format
RSS 2.0
Atom 0.3

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.

Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.