A universal method of information retrieval evaluation: the "missing" link M and the universal IR surface

Authors:
L. Egghe
Affiliations:
LUC, Universitaire Campus, B-3590 Diepenbeek, Belgium
Venue:
Information Processing and Management: an International Journal
Year:
2004

Citing 6
Cited 8

Automated information retrieval: theory and methods

Automated information retrieval: theory and methods
Text retrieval and filtering: analytic models of performance

Text retrieval and filtering: analytic models of performance
Information Retrieval

Information Retrieval
Information Retrieval: Computational and Theoretical Aspects

Information Retrieval: Computational and Theoretical Aspects
Information Retrieval: Algorithms and Heuristics

Information Retrieval: Algorithms and Heuristics
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval

Vector retrieval, fuzzy retrieval and the universal fuzzy IR surface for IR evaluation

Information Processing and Management: an International Journal
Classical retrieval and overlap measures satisfy the requirements for rankings based on a Lorenz curve

Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Percent perfect performance (PPP)

Information Processing and Management: an International Journal
A note on measuring overlap

Journal of Information Science
Existence theorem of the quadruple (P, R, F, M): precision, recall, fallout and miss

Information Processing and Management: an International Journal
The measures precision, recall, fallout and miss as a function of the number of retrieved documents and their mutual interrelations

Information Processing and Management: an International Journal
A second look at Egghe's universal IR surface and a simple derivation of a complete set of universal IR evaluation points

Information Processing and Management: an International Journal
Classical retrieval and overlap measures satisfy the requirements for rankings based on a Lorenz curve

Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

The paper shows that the present evaluation methods in information retrieval (basically recall R and precision P and in some cases fallout F) lack universal comparability in the sense that their values depend on the generality of the IR problem. A solution is given by using all "parts" of the database, including the non-relevant documents and also the not-retrieved documents. It turns out that the solution is given by introducing the measure M being the fraction of the not-retrieved documents that are relevant (hence the "miss" measure). We prove that--independent of the IR problem or of the IR action--the quadruple (P, R, F, M) belongs to a universal IR surface, being the same for all IR-activities. This universality is then exploited by defining a new measure for evaluation in IR allowing for unbiased comparisons of all IR results. We also show that only using one, two or even three measures from the set {P, R, F, M} necessary leads to evaluation measures that are non-universal and hence not capable of comparing different IR situations.