Google, bing and a new perspective on ranking similarity

  • Authors:
  • Bruno Cardoso;João Magalhães

  • Affiliations:
  • Centro de Informática e Tecnologias da Informação, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Lisboa, Portugal;Centro de Informática e Tecnologias da Informação, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa, Lisboa, Portugal

  • Venue:
  • Proceedings of the 20th ACM international conference on Information and knowledge management
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a framework to characterize and compare two search engine results. Typical user-queries are ambiguous and, consequentially, each search engine will compute ranks in different manners, attempting to answer them in the best possible way. Thus, each search engine will have its own bias. Given the importance of the first page results in Web Search Engines, in this paper we propose a framework to assess the information presented in the first page by measuring the information entropy and the correlations between two ranks. Employing the recently proposed Rank-Biased Overlap measure [2] we compare to which extent do Bing and Google rankings in fact differ. We also extend this measure and propose a measure for comparing the information entropy present in two ranks. The proposed measure is based on the correlation of two ranks and the application of Jensen-Shannon's divergence among two document sets. Our methodology starts with 40,000 user queries and crawls the search results for these queries on both search engines. The results allow us to determine the search engines correlations, crawling coverage, information overlap, and information entropy.