Using decision trees to construct a practical parser

  • Authors:
  • Masahiko Haruno;Satoshi Shirai;Yoshifumi Ooyama

  • Affiliations:
  • ATR Human Information Processing Research Laboratories, Kyoto, Japan;NTT Communication Science Laboratories, Kyoto, Japan;NTT Communication Science Laboratories, Kyoto, Japan

  • Venue:
  • COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes novel and practical Japanese parsers that uses decision trees. First, we construct a single decision tree to estimate modification probabilities; how one phrase tends to modify another. Next, we introduce a boosting algorithm in which several decision trees are constructed and then combined for probability estimation. The two constructed parsers are evaluated by using the EDR Japanese annotated corpus. The single-tree method outperforms the conventional Japanese stochastic methods by 4%. Moreover, the boosting version is shown to have significant advantages; 1) better parsing accuracy than its single-tree counterpart for any amount of training data and 2) no over-fitting to data for various iterations.