Against recall: is it persistence, cardinality, density, coverage, or totality?

Authors:
Justin Zobel;Alistair Moffat;Laurence A.F. Park
Affiliations:
The University of Melbourne, Victoria, Australia;The University of Melbourne, Victoria, Australia;The University of Melbourne, Victoria, Australia
Venue:
ACM SIGIR Forum
Year:
2009

Citing 3
Cited 4

Evaluation of evaluation in information retrieval

SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
How reliable are the results of large-scale information retrieval experiments?

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Rank-biased precision for measurement of retrieval effectiveness

ACM Transactions on Information Systems (TOIS)

TREC-CHEM: large scale chemical information retrieval evaluation at TREC

ACM SIGIR Forum
Extended Boolean retrieval for systematic biomedical reviews

ACSC '10 Proceedings of the Thirty-Third Australasian Conferenc on Computer Science - Volume 102
Ad hoc IR: not much room for improvement

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
TOPSIG: topology preserving document signatures

Proceedings of the 20th ACM international conference on Information and knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

The concept of recall has been one of the key elements of system measurement throughout the history of information retrieval, despite the fact that there are many unanswered questions as to its value. In this essay, we review those questions and explore several further issues that affect the usefulness of recall. In particular, we ask whether it is reasonable to expect to be able to measure recall; whether some researchers are conflating the concepts of recall and answer set cardinality; and whether it is plausible that a user would rely on a belief that a system is "high recall" to deeply explore an answer list. Combined with earlier observations about the unknowability of recall, and the lack of a plausible user model in which recall is a measure of satisfaction, we conclude that use of recall as a measure of the effectiveness of ranked querying is indefensible.