Computing term translation probabilities with generalized latent semantic analysis

  • Authors:
  • Irina Matveeva;Gina-Anne Levow

  • Affiliations:
  • University of Chicago, Chicago, IL;University of Chicago, Chicago, IL

  • Venue:
  • EACL '06 Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics: Posters & Demonstrations
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Term translation probabilities proved an effective method of semantic smoothing in the language modelling approach to information retrieval tasks. In this paper, we use Generalized Latent Semantic Analysis to compute semantically motivated term and document vectors. The normalized cosine similarity between the term vectors is used as term translation probability in the language modelling framework. Our experiments demonstrate that GLSA-based term translation probabilities capture semantic relations between terms and improve performance on document classification.