[XML-DEV Mailing List Archive Home] [By Thread] [By Date] [Recent Entries] [Reply To This Message]

Re: Can Searchbots Find Web Pages That Aren't Linked To?

  • From: "Manfred Staudinger" <manfred.staudinger@g...>
  • To: "Pete Cordell" <petexmldev@c...>
  • Date: Sun, 9 Mar 2008 15:37:37 +0100

Re:  Can Searchbots Find Web Pages That Aren't Linked To?
>  From: "Costello, Roger L." <costello@m...>
>
>  I was interested in knowing if searchbots can find web pages that
>  aren't link to.
>
>  So, I conducted a simple experiment:
You asked a simple question and you got a simple answer which is
correct but doesn't really help. The discussion about sitemaps reveals
a more complex picture:
http://code.google.com/support/bin/answer.py?hl=en-uk&answer=40318

a) Rather than asking "if searchbots can find" a web page, I would ask
whether the searchbot has actually found and crawled a web page. This
can be answered by looking at the server log files. If a searchbot
_has_ crawled a web page it may include it in the index (sooner or
later) _or_ not.

b) You are offering 2 pages with duplicate content, one xhtml and one
xml. Now the best choice a search engine can make is to index both but
show only the xhtml to the user. An option to show even the 2nd page
would be perfect ("repeat the search with the omitted results
included.")

Hope this helps,

Manfred


[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index]


PURCHASE STYLUS STUDIO ONLINE TODAY!

Purchasing Stylus Studio from our online shop is Easy, Secure and Value Priced!

Buy Stylus Studio Now

Cast Your Vote

We need your help – Vote for DataDirect XML Products!

  • Best SOA or XML site

Winners and finalists announced at SOA World Conference in November.

Download The World's Best XML IDE!

Accelerate XML development with our award-winning XML IDE - Download a free trial today!

Don't miss another message! Subscribe to this list today.
Email
First Name
Last Name
Company
Subscribe in XML format
RSS 2.0
Atom 0.3
 

Stylus Studio has published XML-DEV in RSS and ATOM formats, enabling users to easily subcribe to the list from their preferred news reader application.


Stylus Studio Sponsored Links are added links designed to provide related and additional information to the visitors of this website. they were not included by the author in the initial post. To view the content without the Sponsor Links please click here.

Site Map | Privacy Policy | Terms of Use | Trademarks
Free Stylus Studio XML Training:
W3C Member
Stylus Studio® and DataDirect XQuery™are products from DataDirect Technologies, is a registered trademark of Progress Software Corporation, in the U.S. and other countries. © 2004-2007 All Rights Reserved.