Conceptual information-based sense disambiguation

  • Authors:
  • You-Jin Chung;Kyonghi Moon;Jong-Hyeok Lee

  • Affiliations:
  • Div. of Electrical and Computer Engineering, POSTECH and AITrc, Pohang, R. of Korea;Div. of Computer and Information Engineering, Silla University, Busan, R. of Korea;Div. of Electrical and Computer Engineering, POSTECH and AITrc, Pohang, R. of Korea

  • Venue:
  • IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most previous corpus-based approaches to word-sense disambiguation (WSD) collect salient words from the context of a target word. However, they suffer from the problem of data sparseness. To overcome the problem, this paper proposes a concept-based WSD method that uses an automatically generated sense-tagged corpus. Grammatical similarities between Korean and Japanese enable the construction of a sense-tagged Korean corpus through an existing high-quality Japanese-to-Korean machine translation system. The sense-tagged corpus can serve as a knowledge source to extract useful clues for word sense disambiguation, such as concept co-occurrence information. In an evaluation, a weighted voting model achieved the best average precision of 77.22%, with an improvement over the baseline by 14.47%, which shows that our proposed method is very promising for practical MT systems.