Evaluating and optimizing the parameters of an unsupervised graph-based WSD algorithm

  • Authors:
  • Eneko Agirre;David Martínez;Oier López de Lacalle;Aitor Soroa

  • Affiliations:
  • University of Basque Country, Donostia, Basque Contry;University of Basque Country, Donostia, Basque Contry;University of Basque Country, Donostia, Basque Contry;University of Basque Country, Donostia, Basque Contry

  • Venue:
  • TextGraphs-1 Proceedings of the First Workshop on Graph Based Methods for Natural Language Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Véronis (2004) has recently proposed an innovative unsupervised algorithm for word sense disambiguation based on small-world graphs called HyperLex. This paper explores two sides of the algorithm. First, we extend Véronis' work by optimizing the free parameters (on a set of words which is different to the target set). Second, given that the empirical comparison among unsupervised systems (and with respect to supervised systems) is seldom made, we used hand-tagged corpora to map the induced senses to a standard lexicon (WordNet) and a publicly available gold standard (Senseval 3 English Lexical Sample). Our results for nouns show that thanks to the optimization of parameters and the mapping method, HyperLex obtains results close to supervised systems using the same kind of bag-of-words features. Given the information loss inherent in any mapping step and the fact that the parameters were tuned for another set of words, these are very interesting results.