[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message] Re: combining XMLEvent lists
On 09/28/2010 04:33 PM, Michael Kay wrote: > Sounds fascinating, and I wish I had time to get involved. It would > certainly be elegant if you could have both the productivity of writing > this declaratively in XSLT and the performance of running it on Hadoop > MapReduce. Intrinsically, the two seem to fit together hand in glove, > but I suspect some engineering effort is needed to make it work. Hello Michael, I think it would be too complicated to achieve the desired grouping with Java. Do you think it makes sense to first serialize the results and then use Saxon's XSLT 2.0 processor to achieve the results? Or do you have any direct input from a List of XMLEvents to Saxon's XSLT processor? I assume it reads XML-data from an InputSource or some kind of a stream. It's a special case, where two or more revisions of one article are made at the same time (in the same second). I would have to look through the XML file with BaseX or Saxon, but I'm pretty sure such cases exist somewhere in the hugh file (as of now I've only extracted a small subset of articles and replaced WikiText inside text-elements with XML). The whole task is to sort the revisions to shredder it into our XML datastorage system (the deltas of the revisions), which has the capability to store and retrieve revisions compactly and efficiently. In parallel I'm currently writing the import of a sorted XML file. My main task (master project and thesis) is or will be the visualization of temporal tree structured data to gain further insights into the evolution of the data, which are otherwise very difficult to realize. regards, Johannes
[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] |
PURCHASE STYLUS STUDIO ONLINE TODAY!Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced! Download The World's Best XML IDE!Accelerate XML development with our award-winning XML IDE - Download a free trial today! Subscribe in XML format
|