XML Editor
Sign up for a WebBoard account Sign Up Keyword Search Search More Options... Options
Chat Rooms Chat Help Help News News Log in to WebBoard Log in Not Logged in
Show tree view Topic
Topic Page 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Go to previous topicPrev TopicGo to next topicNext Topic
Postnext
Richard EggintonSubject: Line Delimitted File Conversion
Author: Richard Egginton
Date: 05 Jul 2013 03:54 AM
Hi All

I have a bit of a strange problem here and I hope someone can help me.

We have been using Stylus Studio for a number of years for converting various flat file formats into XML, but recently have come up against a new format that I have no idea how to deal with.

The basic file structure is as follows:

header_field_1
header_field_2
<empty row>
record_1_field_1
record_1_field_2
record_1_field_3
record_1_field_4
record_1_field_5
<empty row>
record_1_field_6
record_1_field_7
record_1_field_8
<empty row>
<empty row>
record_2_field_1
record_2_field_2
record_2_field_3
record_2_field_4
record_2_field_5
<empty row>
record_2_field_6
record_2_field_7
record_2_field_8
<empty row>
<empty row>

... and so on. So, basically the file is delimitted by CR/LF and each record contains 8 fields.

What I want to do is use Stylus Studio to create an XML Converter to transpose this into an XML format like:

<file>
<record_1>
<field_1></field_1>
<field_2></field_2>
<field_3></field_3>
<field_4></field_4>
<field_5></field_5>
<field_6></field_6>
<field_7></field_7>
<field_8></field_8>
</record_1>
<record_2>
<field_1></field_1>
<field_2></field_2>
<field_3></field_3>
<field_4></field_4>
<field_5></field_5>
<field_6></field_6>
<field_7></field_7>
<field_8></field_8>
</record_2>
</file>

I've tried various things without much success and am really looking for some pointers on how I might be able to achieve this.

Thanks in advance.

Richard

Postnext
Ivan PedruzziSubject: Line Delimitted File Conversion
Author: Ivan Pedruzzi
Date: 05 Jul 2013 12:43 PM

Hi Richard,

Is there any prefix in the row or it's the raw field data?
Is the header only present once in the first 2 lines of the file?
How large are these files in Kilobytes?
Is the double empty line the record separator?
Do you need to run the conversion as batch outside Stylus Studio?


Ivan Pedruzzi
Stylus Studio Team

Postnext
Richard EggintonSubject: Line Delimitted File Conversion
Author: Richard Egginton
Date: 11 Jul 2013 03:29 AM
Originally Posted: 11 Jul 2013 03:28 AM
Hi Ivan

Many thanks for your reply.

>Is there any prefix in the row or it's the raw field data?
Basically, it's the raw data. The first field is a row Id, so we can say that that would always be an integer value.

>Is the header only present once in the first 2 lines of the file?
Yes, the header is only present once at the top of the file

>How large are these files in Kilobytes?
These are averaging 100-500KB in size

>Is the double empty line the record separator?
We get files of this format from a number of different sources and the double space seems to vary. I guess at best we can say that we know that each record contains 5 fields. The double space is not consistent

>Do you need to run the conversion as batch outside Stylus Studio?
Conventionally, we are using Stylus Studio to define a converter and an XSLT to transpose the resulting XML into the format we need. We have an application that we have writted which uses the API to apply the converter and then the XSLT to files that we receive.

Thanks again

Richard

Posttop
Ivan PedruzziSubject: Line Delimitted File Conversion
Author: Ivan Pedruzzi
Date: 15 Jul 2013 04:05 PM
Hi Richard,

The following solution assumes the following:
- There is only one header (2 lines) at the the beginning of the file.
- The file is not huge.
- All empty lines are filtered out.
- The remaining fields are group by 8 for each record.
- The input document is loaded using the built-in CSV converter
For example converter:CSV:root=root?file:///c:/test/data.txt

Does it help?

Ivan Pedruzzi
Stylus Studio Team


//////////////////////////////////////////////
XSLT

<?xml version="1.0"?>
<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:output indent="yes"/>
<xsl:variable name="flatList">
<xsl:for-each select="root/row[position() &gt; 2]/column.0[string-length(.) &gt; 0]">
<xsl:copy-of select="."/>
</xsl:for-each>
</xsl:variable>
<xsl:template match="/">
<root>
<xsl:for-each select="$flatList/*">
<xsl:if test="position() mod 8 = 0">
<xsl:element name="{concat('record_',position() div 8)}">
<field_1><xsl:value-of select="preceding-sibling::*[7]"/></field_1>
<field_2><xsl:value-of select="preceding-sibling::*[6]"/></field_2>
<field_3><xsl:value-of select="preceding-sibling::*[5]"/></field_3>
<field_4><xsl:value-of select="preceding-sibling::*[4]"/></field_4>
<field_5><xsl:value-of select="preceding-sibling::*[3]"/></field_5>
<field_6><xsl:value-of select="preceding-sibling::*[2]"/></field_6>
<field_7><xsl:value-of select="preceding-sibling::*[1]"/></field_7>
<field_8><xsl:value-of select="."/></field_8>
</xsl:element>
</xsl:if>
</xsl:for-each>
</root>
</xsl:template>
</xsl:stylesheet>


//////////////////////////////////////////////
Result

<?xml version='1.0' ?>
<root>
<record_1>
<field_1>record_1_field_1</field_1>
<field_2>record_1_field_2</field_2>
<field_3>record_1_field_3</field_3>
<field_4>record_1_field_4</field_4>
<field_5>record_1_field_5</field_5>
<field_6>record_1_field_6</field_6>
<field_7>record_1_field_7</field_7>
<field_8>record_1_field_8</field_8>
</record_1>
<record_2>
<field_1>record_2_field_1</field_1>
<field_2>record_2_field_2</field_2>
<field_3>record_2_field_3</field_3>
<field_4>record_2_field_4</field_4>
<field_5>record_2_field_5</field_5>
<field_6>record_2_field_6</field_6>
<field_7>record_2_field_7</field_7>
<field_8>record_2_field_8</field_8>
</record_2>
</root>

 
Topic Page 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Go to previous topicPrev TopicGo to next topicNext Topic
Download A Free Trial of Stylus Studio 6 XML Professional Edition Today! Powered by Stylus Studio, the world's leading XML IDE for XML, XSLT, XQuery, XML Schema, DTD, XPath, WSDL, XHTML, SQL/XML, and XML Mapping!  
go

Log In Options

Site Map | Privacy Policy | Terms of Use | Trademarks
Stylus Scoop XML Newsletter:
W3C Member
Stylus Studio® and DataDirect XQuery ™are from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2016 All Rights Reserved.