Improving dependency parsing with subtrees from auto-parsed data

  • Authors:
  • Wenliang Chen;Jun'ichi Kazama;Kiyotaka Uchimoto;Kentaro Torisawa

  • Affiliations:
  • National Institute of Information and Communications Technology, Soraku-gun, Kyoto, Japan;National Institute of Information and Communications Technology, Soraku-gun, Kyoto, Japan;National Institute of Information and Communications Technology, Soraku-gun, Kyoto, Japan;National Institute of Information and Communications Technology, Soraku-gun, Kyoto, Japan

  • Venue:
  • EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a simple and effective approach to improve dependency parsing by using subtrees from auto-parsed data. First, we use a baseline parser to parse large-scale unannotated data. Then we extract subtrees from dependency parse trees in the auto-parsed data. Finally, we construct new subtree-based features for parsing algorithms. To demonstrate the effectiveness of our proposed approach, we present the experimental results on the English Penn Treebank and the Chinese Penn Treebank. These results show that our approach significantly outperforms baseline systems. And, it achieves the best accuracy for the Chinese data and an accuracy which is competitive with the best known systems for the English data.