Supervised Semantic Indexing

  • Authors:
  • Bing Bai;Jason Weston;Ronan Collobert;David Grangier

  • Affiliations:
  • NEC Labs America, Princeton, USA 08540;NEC Labs America, Princeton, USA 08540;NEC Labs America, Princeton, USA 08540;NEC Labs America, Princeton, USA 08540

  • Venue:
  • ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a class of models that are discriminatively trained to directly map from the word content in a query-document or document- document pair to a ranking score. Like Latent Semantic Indexing (LSI), our models take account of correlations between words (synonymy, pol- ysemy). However, unlike LSI our models are trained with a supervised signal directly on the task of interest, which we argue is the reason for our superior results. We provide an empirical study on Wikipedia documents, using the links to define document-document or query-document pairs, where we obtain state-of-the-art performance using our method.