Combining knowledge-based methods and supervised learning for effective Italian word sense disambiguation

  • Authors:
  • Pierpaolo Basile;Marco de Gemmis;Pasquale Lops;Giovanni Semeraro

  • Affiliations:
  • University of Bari, Italy;University of Bari, Italy;University of Bari, Italy;University of Bari, Italy

  • Venue:
  • STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a WSD strategy which combines a knowledge-based method that exploits sense definitions in a dictionary and relations among senses in a semantic network, with supervised learning methods on annotated corpora. The idea behind the approach is that the knowledge-based method can cope with the possible lack of training data, while supervised learning can improve the precision of a knowledge-based method when training data are available. This makes the proposed method suitable for disambiguation of languages for which the available resources are lacking in training data or sense definitions. In order to evaluate the effectiveness of the proposed approach, experimental sessions were carried out on the dataset used for the WSD task in the EVALITA 2007 initiative, devoted to the evaluation of Natural Language Processing tools for Italian. The most effective hybrid WSD strategy is the one that integrates the knowledge-based approach into the supervised learning method, which outperforms both methods taken singularly.