The freshness of web search engine databases

  • Authors:
  • Dirk Lewandowski;Henry Wahlig;Gunnar Meyer-Bautor

  • Affiliations:
  • -;-;Department of Information Science, Heinrich-Heine-University Düsseldorf, Germany

  • Venue:
  • Journal of Information Science
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This study measures the frequency with which search engines update their indices. Therefore, 38 websites that are updated on a daily basis were analysed within a time-span of six weeks. The analysed search engines were Google, Yahoo and MSN. We find that Google performs best overall with the most pages updated on a daily basis, but only MSN is able to update all pages within a time-span of less than 20 days. Both other engines have outliers that are older. In terms of indexing patterns, we find different approaches at the different engines. While MSN shows clear update patterns, Google shows some outliers and the update process of the Yahoo index seems to be quite chaotic. Implications are that the quality of different search engine indices varies and more than one engine should be used when searching for current content.