Optimum polynomial retrieval functions based on the probability ranking principle

  • Authors:
  • Norbert Fuhr

  • Affiliations:
  • Technische Hochschule Darmstadt, Darmstadt, W. Germany

  • Venue:
  • ACM Transactions on Information Systems (TOIS)
  • Year:
  • 1989

Quantified Score

Hi-index 0.00

Visualization

Abstract

We show that any approach to developing optimum retrieval functions is based on two kinds of assumptions: first, a certain form of representation for documents and requests, and second, additional simplifying assumptions that predefine the type of the retrieval function. Then we describe an approach for the development of optimum polynomial retrieval functions: request-document pairs (fl, dm) are mapped onto description vectors x(fl, dm), and a polynomial function e(x) is developed such that it yields estimates of the probability of relevance P(R | x (fl, dm) with minimum square errors. We give experimental results for the application of this approach to documents with weighted indexing as well as to documents with complex representations. In contrast to other probabilistic models, our approach yields estimates of the actual probabilities, it can handle very complex representations of documents and requests, and it can be easily applied to multivalued relevance scales. On the other hand, this approach is not suited to log-linear probabilistic models and it needs large samples of relevance feedback data for its application.