Best Feature Selection for Maximum Entropy-Based Word Sense Disambiguation

  • Authors:
  • Armando Suárez;Manuel Palomar

  • Affiliations:
  • -;-

  • Venue:
  • NLDB '02 Proceedings of the 6th International Conference on Applications of Natural Language to Information Systems-Revised Papers
  • Year:
  • 2002

Quantified Score

Hi-index 0.02

Visualization

Abstract

In this paper, a supervised learning method of word sense disambiguation based on maximum entropy conditional probability models is presented. This system acquires the linguistic knowledge from an annotated corpus and this knowledge is represented in the form of features. Several types of features has been analyzed for a few words selected from the DSO corpus. The main contribution of this paper consists of the selection of the best sets of features for each word from the training data in order to build the classifiers. Our experimentation shows that our method reaches a good accuracy when it is compared with, for example, the systems at SENSEVAL-2.