Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval

  • Authors:
  • Xiangji Huang;Yan Rui Huang;Miao Wen;Aijun An;Yang Liu;Josiah Poon

  • Affiliations:
  • York University, Canada;York University, Canada;York University, Canada;York University, Canada;York University, Canada;University of Sydney, Australia

  • Venue:
  • ICDM '06 Proceedings of the Sixth International Conference on Data Mining
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we investigate the use of data mining, in particular the text classification and co-training techniques, to identify more relevant passages based on a small set of labeled passages obtained from the blind feedback of a retrieval system. The data mining results are used to expand query terms and to re-estimate some of the parameters used in a probabilistic weighting function. We evaluate the data mining based feedback method on the TREC HARD data set. The results show that data mining can be successfully applied to improve the text retrieval performance. We report our experimental findings in detail.