A study on query expansion based on topic distributions of retrieved documents

  • Authors:
  • Midori Serizawa;Ichiro Kobayashi

  • Affiliations:
  • Advanced Sciences, Faculty of Sciences, Ochanomizu University, Tokyo, Japan;Advanced Sciences, Faculty of Sciences, Ochanomizu University, Tokyo, Japan

  • Venue:
  • CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a new relevance feedback (RF) method that uses latent topic information extracted from target documents.In the method, we extract latent topics of the target documents by means of latent Dirichlet allocation (LDA) and expand the initial query by providing the topic distributions of the documents retrieved at the first search. We conduct experiments for retrieving information by our proposed method and confirm that our proposed method is especially useful when the precision of the first search is low. Furthermore, we discuss the cases where RF based on latent topic information and RF based on surface information, i.e., word frequency, work well, respectively.