Online acquisition of Japanese unknown morphemes using morphological constraints

  • Authors:
  • Yugo Murawaki;Sadao Kurohashi

  • Affiliations:
  • Kyoto University, Yoshida-honmachi, Sakyo-ku, Kyoto, Japan;Kyoto University, Yoshida-honmachi, Sakyo-ku, Kyoto, Japan

  • Venue:
  • EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a novel lexicon acquirer that works in concert with the morphological analyzer and has the ability to run in online mode. Every time a sentence is analyzed, it detects unknown morphemes, enumerates candidates and selects the best candidates by comparing multiple examples kept in the storage. When a morpheme is unambiguously selected, the lexicon acquirer updates the dictionary of the analyzer, and it will be used in subsequent analysis. We use the constraints of Japanese morphology and effectively reduce the number of examples required to acquire a morpheme. Experiments show that unknown morphemes were acquired with high accuracy and improved the quality of morphological analysis.