Back to the roots: mean-variance analysis of relevance estimations

Authors:
Guido Zuccon;Leif Azzopardi;Keith van Rijsbergen
Affiliations:
School of Computing Science, University of Glasgow;School of Computing Science, University of Glasgow;School of Computing Science, University of Glasgow
Venue:
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Year:
2011

Citing 4
Cited 1

Fusion of effective retrieval strategies in the same information retrieval system

Journal of the American Society for Information Science and Technology
Mean-Variance Analysis: A New Document Ranking Theory in Information Retrieval

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Portfolio theory of information retrieval

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Beyond shot retrieval: searching for broadcast news items using language models of concepts

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval

Estimating confidence of individual rating predictions in collaborative filtering recommender systems

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently, mean-variance analysis has been proposed as a novel paradigm to model document ranking in Information Retrieval. The main merit of this approach is that it diversifies the ranking of retrieved documents. In its original formulation, the strategy considers both the mean of relevance estimates of retrieved documents and their variance. However, when this strategy has been empirically instantiated, the concepts of mean and variance are discarded in favour of a point-wise estimation of relevance (to replace the mean) and of a parameter to be tuned or, alternatively, a quantity dependent upon the document length (to replace the variance). In this paper we revisit this ranking strategy by going back to its roots: mean and variance. For each retrieved document, we infer a relevance distribution from a series of point-wise relevance estimations provided by a number of different systems. This is used to compute the mean and the variance of document relevance estimates. On the TREC Clueweb collection, we show that this approach improves the retrieval performances. This development could lead to new strategies to address the fusion of relevance estimates provided by different systems.