A Method of Pre-computing Connectivity Relations for Japanese/Korean POS Tagging

  • Authors:
  • Kazuaki Ando;Tae-hun Lee;Masami Shishibori;Jun-Ichi Aoe

  • Affiliations:
  • -;-;-;-

  • Venue:
  • CICLing '01 Proceedings of the Second International Conference on Computational Linguistics and Intelligent Text Processing
  • Year:
  • 2001

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper presents an efficient dictionary structure of Part-of-Speech(POS) Tagging for Japanese/Korean by extending Aho and Corasick's pattern matching machine. The proposed method is a simple and fast algorithm to find all possible morphemes in an input sentence and in a single pass, and it stores the relations of grammatical connectivity of neighboring morphemes into the output functions. Therefore, the proposed method can reduce both costs of the dictionary lookup and the connection check to find the most suitable word segmentation. From the simulation results, it turns out that the proposed method was 21.8% faster (CPU time) than the general approach using the trie structure. Concerning the number of candidates for checking connections, it was 27.4% less than that of the original morphological analysis.