Penn: using word similarities to better estimate sentence similarity

  • Authors:
  • Sneha Jha;H. Andrew Schwartz;Lyle H. Ungar

  • Affiliations:
  • University of Pennsylvania Philadelphia, PA;University of Pennsylvania Philadelphia, PA;University of Pennsylvania Philadelphia, PA

  • Venue:
  • SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the Penn system for SemEval-2012 Task 6, computing the degree of semantic equivalence between two sentences. We explore the contributions of different vector models for computing sentence and word similarity: Collobert and Weston embeddings as well as two novel approaches, namely eigen-words and selectors. These embeddings provide different measures of distributional similarity between words, and their contexts. We used regression to combine the different similarity measures, and found that each provides partially independent predictive signal above baseline models.