Overall comparison at the standard levels of recall of multiple retrieval methods with the Friedman test

Authors:
José M. Casanova;Manuel A. Presedo Quindimil;Álvaro Barreiro
Affiliations:
IRLab, Department of Computer Science, University of A Coruña, Coruña, Spain;Department of Mathematics, University of A Coruña, Coruña, Spain;IRLab, Department of Computer Science, University of A Coruña, Coruña, Spain
Venue:
ECIR'07 Proceedings of the 29th European conference on IR research
Year:
2007

Citing 7
Cited 0

Using statistical testing in the evaluation of retrieval experiments

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Stemming algorithms: a case study for detailed evaluation

Journal of the American Society for Information Science - Special issue: evaluation of information retrieval systems
The impact of query structure and query expansion on retrieval performance

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
How reliable are the results of large-scale information retrieval experiments?

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to information retrieval

ACM Transactions on Information Systems (TOIS)
Information retrieval system evaluation: effort, sensitivity, and reliability

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Handbook of Parametric and Nonparametric Statistical Procedures

Handbook of Parametric and Nonparametric Statistical Procedures

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a new application of the Friedman statistical test of significance to compare multiple retrieval methods. After measuring the average precision at the eleven standard levels of recall, our application of the Friedman test provides a global comparison of the methods. In some experiments this test provides additional and useful information to decide if methods are different.