An empirical study of SLDA for information retrieval

  • Authors:
  • Dashun Ma;Lan Rao;Ting Wang

  • Affiliations:
  • College of Computer, National University of Defense Technology, Changsha, Hunan, P.R. China;College of Humanities and Social Sciences, National University of Defense Technology, Changsha, Hunan, P.R. China;College of Computer, National University of Defense Technology, Changsha, Hunan, P.R. China

  • Venue:
  • AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

A common limitation of many language modeling approaches is that retrieval scores are mainly based on exact matching of terms in the queries and documents, ignoring the semantic relations among terms. Latent Dirichlet Allocation (LDA) is an approach trying to capture the semantic dependencies among words. However, using as document representation, LDA has no successful applications in information retrieval (IR). In this paper, we propose a single-document-based LDA (SLDA) document model for IR. The proposed work has been evaluated on four TREC collections, which shows that SLDA document modeling method is comparable to the state-of-the-art language modeling approaches, and it's a novel way to use LDA model to improve retrieval performance.