Elements of information theory
Elements of information theory
Accessibility of information on the Web
intelligence
Real life, real users, and real needs: a study and analysis of user queries on the web
Information Processing and Management: an International Journal
Determining the user intent of web search engine queries
Proceedings of the 16th international conference on World Wide Web
Comparing rankings of search results on the Web
Information Processing and Management: an International Journal - Special issue: Infometrics
A similarity measure for indefinite rankings
ACM Transactions on Information Systems (TOIS)
Web search solved?: all result rankings the same?
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Hi-index | 0.00 |
In this paper, we propose a framework to characterize and compare two search engine results. Typical user-queries are ambiguous and, consequentially, each search engine will compute ranks in different manners, attempting to answer them in the best possible way. Thus, each search engine will have its own bias. Given the importance of the first page results in Web Search Engines, in this paper we propose a framework to assess the information presented in the first page by measuring the information entropy and the correlations between two ranks. Employing the recently proposed Rank-Biased Overlap measure [2] we compare to which extent do Bing and Google rankings in fact differ. We also extend this measure and propose a measure for comparing the information entropy present in two ranks. The proposed measure is based on the correlation of two ranks and the application of Jensen-Shannon's divergence among two document sets. Our methodology starts with 40,000 user queries and crawls the search results for these queries on both search engines. The results allow us to determine the search engines correlations, crawling coverage, information overlap, and information entropy.