Improving retrieval feedback with multiple term-ranking function combination

  • Authors:
  • Claudio Carpineto;Giovanni Romano;Vittorio Giannini

  • Affiliations:
  • Fondazione Ugo Bordoni, Roma, Italy;Fondazione Ugo Bordoni, Roma, Italy;WIND Telecomunicazioni S.p.A, Roma, Italy

  • Venue:
  • ACM Transactions on Information Systems (TOIS)
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this article we consider methods for automatic query expansion from top retrieved documents (i.e., retrieval feedback) that make use of various functions for scoring expansion terms within Rocchio's classical reweighting scheme. An analytical comparison shows that the retrieval performance of methods based on distinct term-scoring functions is comparable on the whole query set but differs considerably on single queries, consistent with the fact that the ordered sets of expansion terms suggested for each query by the different functions are largely uncorrelated. Motivated by these findings, we argue that the results of multiple functions can be merged, by analogy with ensembling classifiers, and present a simple combination technique based on the rank values of the suggested terms. The combined retrieval feedback method is effective not only with respect to unexpanded queries but also to any individual method, with notable improvements on the system's precision. Furthermore, the combined method is robust with respect to variation of experimental parameters and it is beneficial even when the same information needs are expressed with shorter queries.