A portable & quick Japanese parser: QJP

  • Authors:
  • Masayuki Kameda

  • Affiliations:
  • Ricoh Company, LTD., Yokohama, Japan

  • Venue:
  • COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

QJP is a portable and quick software module for Japanese processing. QJP analyzes a Japanese sentence into segmented morphemes/words with tags and a syntactic bunsetsu kakari-uke structure based on the two strategies, a) Morphological analysis based on character-types and functional-words and b) Syntactic analysis by simple treatment of structural ambiguities and ignoring semantic information. QJP is small, fast and robust, because 1) dictionary size (less than 100KB) and required memory size (260KB) are very small, 2) analysis speed is fast (more than 100 words/sec on 80486-PC), and 3) even a 100-word long sentence containing unknown words is easily processed.Using QJP and its analysis results as a base and adding other functions for processing Japanese documents, a variety of applications can be developed on UNIX workstations or even on PCs.