[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

RE: XSchema: ease of use (design goal 5)

  • From: "Rick Jelliffe" <ricko@a...>
  • To: "XML Dev" <xml-dev@i...>
  • Date: Wed, 3 Jun 1998 01:47:00 +1000

how to use rxml
> From: Michael Kay

> * one of the limitations of XML is that we cannot constrain
> the content of
> character strings (in attributes or PCDATA) in a DTD
> * the ideal way to define such constraints would be with
> regular expressions or BNF production rules
> * let's call "XML with constrained character strings" Rich
> XML or RXML. Of course an RXML document is an XML document;
> it just has some "content validity" rules in addition to the
> XML well-formedness and validation rules.

Actually, there is already a standard syntax to let you constrain the
content of character strings, using the SGML Lexical Typing Definition
Requirements (LTDR). In standard jargon this is called "lexical typing": the
stated intention of the ISO Working Group maintaining SGML is to move these
into the SGML standard--at the moment they are just parked as part of the
HyperText and Time Scheduling Languages standard (HyTime).

LTDR Lexical typing provides a syntax to let you use any syntax you like to
constain strings. In particular it allows POSIX regular expressions. There
is also a token-matching language, which can look quite like content models.
You can see the standard online at
http://www.ornl.gov/sgml/wg8/docs/n1920/html/clause-A.2.html

<RJ:PLUG>You can also see this explained in my book "The XML & SGML
Cookbook" (just out this week !!!) on page 2-117 "Defining Data Types".
</RJ:Plug>

You can embed your own syntax inside elements using NOTATION attributes, or
using PIs, of course.

> * we can imagine a "pre-parser" which takes an RXML document
> and automatically generates additional XML markup so that
> all the syntax is now fully accessible as elements and
> attributes. This pre-parsed document would no longer be
> easily readable, but it would be easily processable using
> standard XML tools

If you are talking about a pre-parser, then the danger is that you are no
longer using XML. The ability to alias strings and delimiters to (entity
references which deference to) elements, PIs, entity references etc. was
part of SGML (i.e., SHORTREF, DATATAG) that was jettisoned for XML. If you
are talking about the ability to pre-process an entity according to a
(pipeline of) processes, that also is part of SGML Extended Facilities
(i.e., Formal System Identifiers) which was jettisoned by XML when they
decided to use URI syntax (It could be introduced again easily: or the URI
query syntax could of course be used to provide extra parameters too; or a
custom XML document-type could specify a tranformation itself as part of
storage/entity management--but then the data is not XML, and so is outside
the XSchema scope, I think.)

> * We could encode both the current DTD information and the
> additional constraints in RXML
> * if we use RXML rather than plain XML to encode our DTD, we
> can continue to use BNF-like production rules written as
> text, while still being able to process the thing using
> general-purpose XML machinery.

HyTime LDTR went down this track (i.e. a variant, specialized syntax) and I
think it did not work so well.

But I would be happy if the XML markup declarations were given explicit
specifications in terms of XSchema: that would allow a simple preprocessor
to convert existing DTDs into XSchemas.

We have had two constituencies here: one wants to have declarations using
element syntax, the other wants to add new constraints.  I would again
submit that there is a third which we should not ignore: people who might
not want to either change to XML markup declarations or to something like
XData, but want to keep their old schema-definition systems. Anyone who has
played with Adobe's excellent EDD knows how mind-numbing things can get when
you have to support one schema syntax with another almost identical one,
even with sophisticated translators.


Rick Jelliffe

==========================================================
The XML & SGML Cookbook, by Rick Jelliffe
Charles F. Goldfarb Series on Open Information Management
656 pages + CD-ROM, Prentice Hall 1998, ISBN 0-13-614233-0
http://www.sil.org/sgml/jelliffeXMLAnn.html
http://www.phptr.com/  > Book Search > "Jelliffe"
http://www.amazon.com/exec/obidos/ASIN/0136142230/002-4102466-3352420


xml-dev: A list for W3C XML Developers. To post, mailto:xml-dev@i...
Archived as: http://www.lists.ic.ac.uk/hypermail/xml-dev/
To (un)subscribe, mailto:majordomo@i... the following message;
(un)subscribe xml-dev
To subscribe to the digests, mailto:majordomo@i... the following message;
subscribe xml-dev-digest
List coordinator, Henry Rzepa (mailto:rzepa@i...)


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery ™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2013 All Rights Reserved.