Risk-Aware Information Retrieval

  • Authors:
  • Jianhan Zhu;Jun Wang;Michael Taylor;Ingemar J. Cox

  • Affiliations:
  • University College London, Ipswich, U.K.;University College London, Ipswich, U.K.;Microsoft Research, Cambridge, U.K.;University College London, Ipswich, U.K.

  • Venue:
  • ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Probabilistic retrieval models usually rank documents based on a scalar quantity. However, such models lack any estimate for the uncertainty associated with a document's rank. Further, such models seldom have an explicit utility (or cost) that is optimized when ranking documents. To address these issues, we take a Bayesian perspective that explicitly considers the uncertainty associated with the estimation of the probability of relevance, and propose an asymmetric cost function for document ranking. Our cost function has the advantage of adjusting the risk in document retrieval via a single parameter for any probabilistic retrieval model. We use the logit model to transform the document posterior distribution with probability space [0,1] into a normal distribution with variable space ( *** *** , + *** ). We apply our risk adjustment approach to a language modelling framework for risk adjustable document ranking. Our experimental results show that our risk-aware model can significantly improve the performance of language models, both with and without background smoothing. When our method is applied to a language model without background smoothing, it can perform as well as the Dirichlet smoothing approach.