XML Editor
Sign up for a WebBoard account Sign Up Keyword Search Search More Options... Options
Chat Rooms Chat Help Help News News Log in to WebBoard Log in Not Logged in
Show tree view Topic
Topic Page 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Go to previous topicPrev TopicGo to next topicNext Topic
Doug LundinSubject: Screen scraping using Stylus?
Author: Doug Lundin
Date: 19 Apr 2006 11:13 AM
Advice using Stylus

My goal is to build a data mining application using PR Newswire (http://biz.yahoo.com/prnews/archive.html)

Essentially, I will be doing ***LIMITED*** screen scraping (given a date, etc.)

There are some good tools available such as WebQL but I am wondering if Stylus can do this as well. I would prefer a more general tool if it is available.

I have been told screen scraping is generally easier with XLST 2.0 rather than XQuery. Do you agree?

There doesn't appear to be any samples on stylusstudio.com that offers insight.

Suggestions on how to evaluate Stylus for this task?

Thanks in advance

Minollo I.Subject: Screen scraping using Stylus?
Author: Minollo I.
Date: 19 Apr 2006 11:48 AM
Using Stylus Studio you can take advantage of the HTML to XML converter and feed the HTML as a well formed XML into an XSLT or XQuery. You can take a look at http://www.stylusstudio.com/xml_import_export.html for more details about how to use converters in Stylus Studio.

About the XSLT 2.0 vs. XQuery question, if regular expression handling is important to you for the kind of work you need to do, then maybe XSLT 2.0 is a good choice; you may want to take a look at this Michael Kay's article for a well written comparison of the two languages: http://www.idealliance.org/proceedings/xtech05/papers/02-03-01/

Hope this helps,

Topic Page 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Go to previous topicPrev TopicGo to next topicNext Topic
Download A Free Trial of Stylus Studio 6 XML Professional Edition Today! Powered by Stylus Studio, the world's leading XML IDE for XML, XSLT, XQuery, XML Schema, DTD, XPath, WSDL, XHTML, SQL/XML, and XML Mapping!  

Log In Options

Site Map | Privacy Policy | Terms of Use | Trademarks
Stylus Scoop XML Newsletter:
W3C Member
Stylus Studio® and DataDirect XQuery ™are from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2016 All Rights Reserved.