Building an optimal WSD ensemble using per-word selection of best system

  • Authors:
  • Harri M. T. Saarikoski;Steve Legrand

  • Affiliations:
  • KIT Language Technology Doctorate School, Helsinki University, Finland;Department of Computer Science, University of Jyväskylä, Finland

  • Venue:
  • CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In Senseval workshops for evaluating WSD systems [1,4,9], no one system or system type (classifier algorithm, type of system ensemble, extracted feature set, lexical knowledge source etc.) has been discovered that resolves all ambiguous words into their senses in a superior way. This paper presents a novel method for selecting the best system for target word based on readily available word features (number of senses, average amount of training per sense, dominant sense ratio). Applied to Senseval-3 and Senseval-2 English lexical sample state-of-art systems, a net gain of approximately 2.5 – 5.0% (respectively) in average precision per word over the best base system is achieved. The method can be applied to any base system or target word in any language.