A maximum entropy-based word sense disambiguation system

  • Authors:
  • Armando Suárez;Manuel Palomar

  • Affiliations:
  • Universidad de Alicante, Alicante, Spain;Universidad de Alicante, Alicante, Spain

  • Venue:
  • COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a supervised learning system of word sense disambiguation is presented. It is based on conditional maximum entropy models. This system acquires the linguistic knowledge from an annotated corpus and this knowledge is represented in the form of features. Several types of features have been analyzed using the SENSEVAL-2 data for the Spanish lexical sample task. Such analysis shows that instead of training with the same kind of information for all words, each one is more effectively learned using a different set of features. This best-feature-selection is used to build some systems based on different maximum entropy classifiers, and a voting system helped by a knowledge-based method.