Parser showdown at the wall street corral: an empirical investigation of error types in parser output

  • Authors:
  • Jonathan K. Kummerfeld;David Hall;James R. Curran;Dan Klein

  • Affiliations:
  • University of California, Berkeley, CA;University of California, Berkeley, CA;University of Sydney, Sydney, NSW, Australia;University of California, Berkeley, CA

  • Venue:
  • EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Constituency parser performance is primarily interpreted through a single metric, F-score on WSJ section 23, that conveys no linguistic information regarding the remaining errors. We classify errors within a set of linguistically meaningful types using tree transformations that repair groups of errors together. We use this analysis to answer a range of questions about parser behaviour, including what linguistic constructions are difficult for state-of-the-art parsers, what types of errors are being resolved by rerankers, and what types are introduced when parsing out-of-domain text.