XML Editor
Sign up for a WebBoard account Sign Up Keyword Search Search More Options... Options
Chat Rooms Chat Help Help News News Log in to WebBoard Log in Not Logged in
Show tree view Topic
Topic Page 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Go to previous topicPrev TopicGo to next topicNext Topic
Postnext
Doug LundinSubject: Is a default adapter - not Tidy - being used here?
Author: Doug Lundin
Date: 04 Apr 2007 11:53 AM
I don't understand what adapter, if any, is being used here. I can convert HTML to XML without using an adapter but when I choose to use Tidy, I receive an error. Does this mean that Stylus has a default functionality - similar to Tidy - that is used if Tidy is not selected? Any assistance is appreciated.

No adapter:
Choose File | Document wizards | XML Editor | HTML to XML

Choose HTML file to convert: http://biz.yahoo.com/e/070404/fbsi8-k.html

Click OK and it is successful

Using Tidy as an adapter:
Choose File | Document wizards | XML Editor | HTML to XML

Choose HTML file to convert: http://biz.yahoo.com/e/070404/fbsi8-k.html

Click ...
Respecify http://biz.yahoo.com/e/070404/fbsi8-k.html
Click Convert to XML using adapter
Click OPEN
Choose HTML-to-XHTML HtmlTidy - leave default properties
Click OK
Click OK

Here is the error reported
Tidy (vers Sep 26, 2004) Parsing "InputStream"
line 73 column 15 - Error: discarding unexpected </form>
line 73 column 30 - Error: discarding unexpected </td>
line 73 column 35 - Error: discarding unexpected </tr>
line 288 column 35 - Error: discarding unexpected <td>
line 308 column 10 - Error: discarding unexpected </form>

Postnext
Doug LundinSubject: Is a default adapter - not Tidy - being used here?
Author: Doug Lundin
Date: 04 Apr 2007 05:46 PM
How can I verify what adapter is being used by the Stylus tool?

Postnext
Tony LavinioSubject: Is a default adapter - not Tidy - being used here?
Author: Tony Lavinio
Date: 04 Apr 2007 08:24 PM
Doing this:
Choose File | Document wizards | XML Editor | HTML to XML
does use HTML Tidy, but with some non-default settings. It
also uses a more sophisticated version of Tidy.

Using the adapter, try changing the errors= property to see
if it will let you open the HTML file. The adapter version
of Tidy is a slightly different version due to the limitations
imposed by having it inserted as a layer between the filesystem
and the editor.

Postnext
Doug LundinSubject: Is a default adapter - not Tidy - being used here?
Author: Doug Lundin
Date: 04 Apr 2007 08:34 PM
Is it best to use the more sophisticated version of Tidy?

I just tried using an adapter and the Stylus tool - here is the URL

adapter:HTMLTidy:newline=crlf:encoding=utf-8:warnings=no:errors=no?http://biz.yahoo.com/e/070404/fbsi8-k.html

and get the error

java.io.IOException: Premature end of file. {AdapterFile.copyToFile}
at com.stylusstudio.adapter.fs.AdapterFile.checkException(AdapterFile.java:402)
at com.stylusstudio.adapter.fs.AdapterFile.copyToFile(AdapterFile.java:485)
Caused by: org.xml.sax.SAXParseException: Premature end of file.
at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(Unknown Source)
at com.stylusstudio.adapter.simple.HTML.toXML(HTML.java:74)
at com.stylusstudio.adapter.AdapterBase.toXML(AdapterBase.java:248)
at com.stylusstudio.adapter.fs.AdapterRunner.run(AdapterRunner.java:56)
at java.lang.Thread.run(Unknown Source)

Postnext
Doug LundinSubject: Is a default adapter - not Tidy - being used here?
Author: Doug Lundin
Date: 04 Apr 2007 08:40 PM
What I'm really trying to do is convert this HTML in my java code - can I call this "Tidy-like" from java?

Postnext
Tony LavinioSubject: Is a default adapter - not Tidy - being used here?
Author: Tony Lavinio
Date: 05 Apr 2007 09:35 AM
In digging through the HTML on Yahoo!, I don't think the Tidy
version we use in the XML Converter will work for you.

Might I suggest looking into TagSoup? It can be used as a
replacement for the parser, and has a more aggressive error-recovery
policy. See http://home.ccil.org/~cowan/XML/tagsoup/

You could do this to use it with Stylus Studio:

1. Convert the Yahoo! page manually, and save it as XML.
2. Build your maps.
3. Deploy, but using TagSoup as the parser.

Hope this helps.

Postnext
Doug LundinSubject: Is a default adapter - not Tidy - being used here?
Author: Doug Lundin
Date: 05 Apr 2007 09:42 AM
I appreciate the option but am wondering if I can leverage the java api and your more sophisticated version of Tidy? This will be deployed as an automated process so converting HTML into XML manually is not an option.

Thanks

Posttop
Tony LavinioSubject: Is a default adapter - not Tidy - being used here?
Author: Tony Lavinio
Date: 06 Apr 2007 12:40 AM
The problem is that the version of Tidy used in the wizard is the
'C' version, and the version used in the Java adapters is JTidy,
which is a subset of the 'C' version.

 
Topic Page 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Go to previous topicPrev TopicGo to next topicNext Topic
Download A Free Trial of Stylus Studio 6 XML Professional Edition Today! Powered by Stylus Studio, the world's leading XML IDE for XML, XSLT, XQuery, XML Schema, DTD, XPath, WSDL, XHTML, SQL/XML, and XML Mapping!  
go

Log In Options

Site Map | Privacy Policy | Terms of Use | Trademarks
Stylus Scoop XML Newsletter:
W3C Member
Stylus Studio® and DataDirect XQuery ™are from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2016 All Rights Reserved.