RE: for-each-group and result-document splitting to le

Play the video

Subject: RE: for-each-group and result-document splitting to less files.
From: "Michael Kay" <mike@xxxxxxxxxxxx>
Date: Wed, 10 Sep 2008 18:27:31 +0100

> -- though makes me curious... how hard would it be to change 
> this so that the groupings were based on the count() of items 
> in the nodeset?
> i.e. make divisions that were basically equal regardless of the input?
>  So if there were almost no entries in the first half of the 
> alphabet, the first division would be a-r, then s-t (if a lot 
> there), then u-w (if few there), etc.  maybe passing how many 
> sets to split it into total?

If the sorted sequence is $in, and you want to split it roughly into $n
groups, then you can take the size of a group as ceiling(count($in) div $n).
Then the letters that form the starts of groups might have initial letters

$initialLetters := 
distinct-values('a', for $i in 1 to $n return
substring($in[ceiling(count($in) div $n)]/@title, 1, 1))

The next step is to construct the translation table such as
'aaaaaaffffffsssssvvvvv' by replacing each letter in a-z with the highest
letter from $initialLetters that is <= the letter in question, that is:

$alphabet :=
   for $i in 1 to 26 return substring('abcdefghijklmnopqrstuvwxyz', $i, 1)

$transTable :=
string-join(for $c in $alphabet return max($initialLetters[. le $c]), '')
           

Not tested, of course.

Michael Kay
http://www.saxonica.com/

Current Thread

RE: for-each-group and result-document splitting to less files., (continued)
- Michael Kay - 5 Sep 2008 13:19:40 -0000
  - James Cummings - 8 Sep 2008 12:04:04 -0000
    - Message not available
    - Wendell Piez - 8 Sep 2008 22:31:08 -0000
    - Message not available
    - James Cummings - 10 Sep 2008 17:03:27 -0000
    - Michael Kay - 10 Sep 2008 17:28:06 -0000 <=
    - Message not available
    - Wendell Piez - 10 Sep 2008 18:26:00 -0000

<- Previous	Index	Next ->
Re: for-each-group and result, James Cummings	Thread	Re: for-each-group and result, Wendell Piez
Re: for-each-group and result, James Cummings	Date	Re: for-each-group and result, Wendell Piez
	Month

PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Subscribe in XML format

RSS 2.0
Atom 0.3

XML Editor - Download a 15 Day Free Trial Now >

See What's New in Stylus Studio >

Buy Stylus Studio - XML Editor - Now >