Comparison of the baseline knowledge-, corpus-, and web-based similarity measures for semantic relations extraction

  • Authors:
  • Alexander Panchenko

  • Affiliations:
  • Université catholique de Louvain, Belgium

  • Venue:
  • GEMS '11 Proceedings of the GEMS 2011 Workshop on GEometrical Models of Natural Language Semantics
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Unsupervised methods of semantic relations extraction rely on a similarity measure between lexical units. Similarity measures differ both in kinds of information they use and in the ways how this information is transformed into a similarity score. This paper is making a step further in the evaluation of the available similarity measures within the context of semantic relation extraction. We compare 21 baseline measures -- 8 knowledge-based, 4 corpus-based, and 9 web-based metrics with the BLESS dataset. Our results show that existing similarity measures provide significantly different results, both in general performances and in relation distributions. We conclude that the results suggest developing a combined similarity measure.