Promoting divergent terms in the estimation of relevance models

  • Authors:
  • Javier Parapar;Álvaro Barreiro

  • Affiliations:
  • IRLab, Computer Science Department, University of A Coruña, Spain;IRLab, Computer Science Department, University of A Coruña, Spain

  • Venue:
  • ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

Traditionally the use of pseudo relevance feedback (PRF) techniques for query expansion has been demonstrated very effective. Particularly the use of Relevance Models (RM) in the context of the Language Modelling framework has been established as a high-performance approach to beat. In this paper we present an alternative estimation for the RM promoting terms that being present in the relevance set are also distant from the language model of the collection. We compared this approach with RM3 and with an adaptation to the Language Modelling framework of the Rocchio's KLD-based term ranking function. The evaluation showed that this alternative estimation of RM reports consistently better results than RM3, showing in average to be the most stable across collections in terms of robustness.