Guessing parts-of-speech of unknown words using global information

  • Authors:
  • Tetsuji Nakagawa;Yuji Matsumoto

  • Affiliations:
  • Oki Electric Industry Co., Ltd., Chuo-ku, Osaka, Japan;Nara Institute of Science and Technology, Ikoma, Nara, Japan

  • Venue:
  • ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a method for guessing POS tags of unknown words using local and global information. Although many existing methods use only local information (i.e. limited window size or intra-sentential features), global information (extra-sentential features) provides valuable clues for predicting POS tags of unknown words. We propose a probabilistic model for POS guessing of unknown words using global information as well as local information, and estimate its parameters using Gibbs sampling. We also attempt to apply the model to semi-supervised learning, and conduct experiments on multiple corpora.