Article Read. You Can Find All Kind of Articles

Home | Search Article

Search



Advanced Search

How Search Engines Find Documents

Kategori  Category : Site Promotion
Read  Times Read : 27
Date  Date : 28 April 2008 07:00

 by: Kamlesh Patel

Every document on the Web is associated with a URL (Uniform Resource Locator). Inthis context, we will use the terms document and URL interchangeably. This is an oversimplification, as some URLs return different documents to the user depending on such factors as their location, browser type, form input etc., but this terminology suits our purposes for now.

To find every document on the Web would mean more than finding every URL on the Web. For this reason, search engines do not currently attempt to locate every possible unique document, although research is always underway in this area. Instead, crawling search engines focus their attention on unique URLs; although some dynamic sites may display different content at the same URL (via form inputs or other dynamic variables), search engines will see that URL as a single page.

The typical crawling search engine uses three main resources to build a list of URLs to crawl. Not all search engines use all of these:

Hyperlinks on existing Web pages

The bulk of the URLs found in the databases of most crawling search engines consists of links found on Web pages that the spider has already crawled. Finding a link to a document on one page implies that someone found that link important enough to add it to their page.

Submitted URLs

All the crawling search engines have some sort of process that allows users or Website owners to submit URLs to be crawled. In the past, all search engines offered a free manual submission process, but now, many accept only paid submissions. Google is a notable exception, with no apparent plans to stop accepting free submissions, although there is great doubt as to whether submitting actually does anything.

XML data feeds

Paid inclusion programs, such as the Yahoo! Site Match system, include trusted feed programs that allow sites to submit XML-based content summaries for crawling and inclusion. As the Semantic Web begins to emerge, and more sites begin to offer RSS (RDF Site Summary) news feed files, some search engines have begun to read these files in order to find fresh content.

Search engines run multiple crawler programs, and each crawler program (or spider) receives instructions from the scheduler about which URL (or set of URLs) to fetch next. We will see how search engines manage the scheduling process shortly, but first, lets take a look at how the search engines crawler program works.

Source: http://www.elitedatasolution.com

About The Author

Kamlesh Patel

Im freelancer Search engine optimization expert from India. We

provide Search engine optimization services including link building,

meta tags etc.

info@elitedatasolution.com

 

Site Promotion

Most Popular Articles

Random Article 1

Random Article 2

  • A Letter to Santa From An Internet Marketer
  •  by: Halstatt Pires

    Yo, Santa! Hows it going in the great white north? Seeing as it tis the season, here is my letter about what I want for Christmas.

    Been Good

    Santa, I know you do that whole good versus bad thing. I promise Ive been a good internet marketer this year. I haven

  • How To Write A Profit Pulling Article
  •  by: Rich Hamilton, Jr

    Having an article published is one of the fastest and easiest ways to build your credibility as an industry expert. At the end of the each article you are given the opportunity to gain free exposure for your online business or affiliate program.

    As more and m

Random Article 3

Random Article 4

  • ch Engine Submissions Made Easy!
  • Sear

     by: Robin Nobles

    One area of search engine marketing that has changed dramatically over the years is submissions. Submitting to the search engines used to be so complicated, with having to choose between manual versus software submissions; only being able to submit so many pages a

  • How To get your JV Partner to Accept Recommendations From You
  •  by: Abe Cherian

    One of the brightest moments in my business career were when I was visiting with my joint venture partner and talking about our future relationship together. I remember asking Bill over and we instantly made friends. Simply by chatting about events we both enjoyed doin

http://kids-and-teens-blog.blogspot.com/ http://hobbies--blog.blogspot.com/ http://gadgets-and-gizmos-blog.blogspot.com/ http://education--blog.blogspot.com/ http://pets-and-animals-blog.blogspot.com/ http://legal-matters-blog.blogspot.com/ http://parenting-blog.blogspot.com/ http://site-promotion--blog.blogspot.com/ http://food-and-drink--blog.blogspot.com/ http://auto-and-trucks-blog.blogspot.com/ http://recreation-and-sports-blog.blogspot.com/ http://writing--blog.blogspot.com/ http://women--blog.blogspot.com/ http://home-improvement--blog.blogspot.com/ http://travel-and-leisure-blog.blogspot.com/ http://web-development--blog.blogspot.com/ http://family--blog.blogspot.com/ http://health--blog.blogspot.com/ indir teknolojix.com hiperucuz.com