Re: Fwd: Not using mixed content? Then don't use XML

Cart

XML Editor - Download a Free Trial >

See What's New >

Buy Now >

[Home] [By Thread] [By Date] [Recent Entries]

From: John Cowan <johnwcowan@g...>
To: Rick Jelliffe <rjelliffe@a...>
Date: Wed, 10 Apr 2013 01:52:45 -0400

On Wed, Apr 10, 2013 at 1:23 AM, Rick Jelliffe <rjelliffe@a...> wrote:

This is a truly excellent post, which you should work up into an article somewhere.

1) Test driven development.Â Â Before=as=so-soon-after-that-noone-noticesÂ you make some software, you make a test for it.Â If the document has a fixed structure, you can test by instances. If the document is semi-structured or recursive, your test specification has to allow those kinds of structures too: and for XML such a specification is called a schema. Â

Examplotron is particularly nice here, or using Trang to generate a post hoc RELAX NG schema from a reasonable library of existing instances. Â Any new instance that fails output validation is then added to the library, and the schema is regenerated. Â (Unfortunately, Trang can't accept a schema on the input side when doing this, or you'd just need to keep the schema. Â I'm working on a tool that will be much cruder than Trang but will have this capability.)

2) Quality assurance.Â I work in a company with a globally distributed development and production system: (it is so big that US content architects may forget they have brother content architects in other countries when casually posting :-).

Not me, dude. Â My "two dozen" included our Indian siblings. Â If there are schema developers elsewhere than (in caps) Content Architecture, I don't know about it.

3) Conway's Law.Â A successful system must have sub-system boundaries that match the organization.Â Â Formalizing a boundary that matches internal organizational boundaries helps reduce communication costs. Formalizing a boundary within a team needs to allow flexibility, agility, otherwise it will get in the way. Â

Indeed, that's what our internal schemas are basically for. Â "Boundarylessness" is one of Â $EMPLOYER's so-called key values, which (as usual) is an indication that they aren't (yet) very good at it.

Where I would disagree with Simon, I think, is that I think the advent of JSON for point-to-point interchange actually means that probably you should always use a schema with XML:Â if you don't need a schema perhaps you should be using JSON?Â Â Â

The problem with JSON is that arrays provide ordering and objects provide naming, but if you want named ordering you have to go a level deeper, which is annoying. Â A JSON document containing a sequence of paragraphs interspersed with blockquotes, you have to make each element of the outermost array a dummy object like {"type" : "paragraph", "content" : (whatever)}". Â Not all JSON systems correctly handle the case of the top-level item being an array, either.

Actually, that is too much: what trumps often is how easy a format is to fit into your current ecosystem and capabilities:

Sure, local issues almost always trump global architecture in practice, unless there are *very* strong top-down drivers.

--
GMail doesn't have rotating .sigs, but you can see mine at http://www.ccil.org/~cowan/signatures

Follow-Ups:
- Re: Fwd: Not using mixed content? Then don't use XML
  - From: Rick Jelliffe <rjelliffe@a...>

References:
- Re: Fwd: Not using mixed content? Then don't use XML
  - From: Peter Ring <peter.ring@t...>
- RE: Fwd: Not using mixed content? Then don't use XML
  - From: "Len Bullard" <cbullard@h...>
- Re: Fwd: Not using mixed content? Then don't use XML
  - From: Rick Jelliffe <rjelliffe@a...>

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >