An empirical study on rule granularity and unification interleaving toward an efficient unification-based parsing system

  • Authors:
  • Masaaki Nagata

  • Affiliations:
  • ATR Interpreting Telephony Research Laboratories, Kyoto, Japan

  • Venue:
  • COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 1
  • Year:
  • 1992

Quantified Score

Hi-index 0.02

Visualization

Abstract

This paper describes an empirical study on the optimal granularity of the phrase structure rules and the optimal strategy for interleaving CFG parsing with unification in order to implement an efficient unification-based parsing system. We claim that using "medium-grained" CFG phrase structure rules, which balance the computational cost of CFG parsing and unification, are a cost-effective solution for making unification-based grammar both efficient and easy to maintain. We also claim that "late unification", which delays unification until a complete CFG parse is found, saves unnecessary copies of DAGs for irrelevant subparses and improves performance significantly. The effectiveness of these methods was proved in an extensive experiment. The results show that, on average, the proposed system parses 3.5 times faster than our previous one. The grammar and the parser described in this paper are fully implemented and used as the Japanese analysis module in SL-TRANS, the speech-to-speech translation system of ATR.