Unsupervised learning of word sense disambiguation rules by estimating an optimum iteration number in the EM algorithm

  • Authors:
  • Hiroyuki Shinnou;Minoru Sasaki

  • Affiliations:
  • Ibaraki University, Hitachi, Ibaraki, Japan;Ibaraki University, Hitachi, Ibaraki, Japan

  • Venue:
  • CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we improve an unsupervised learning method using the Expectation-Maximization (EM) algorithm proposed by Nigam et al. for text classification problems in order to apply it to word sense disambiguation (WSD) problems. The improved method stops the EM algorithm at the optimum iteration number. To estimate that number, we propose two methods. In experiments, we solved 50 noun WSD problems in the Japanese Dictionary Task in SENSEVAL2. The score of our method is a match for the best public score of this task. Furthermore, our methods were confirmed to be effective also for verb WSD problems.