Knowledge sources for constituent parsing of german, a morphologically rich and less-configurational language

  • Authors:
  • Alexander Fraser;Helmut Schmid;Richárd Farkas;Renjing Wang;Hinrich Schütze

  • Affiliations:
  • Institute for NLP, University of Stuttgart;Institute for NLP, University of Stuttgart;Institute for NLP, University of Stuttgart;Institute for NLP, University of Stuttgart;Institute for NLP, University of Stuttgart

  • Venue:
  • Computational Linguistics
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study constituent parsing of German, a morphologically rich and less-configurational language. We use a probabilistic context-free grammar treebank grammar that has been adapted to the morphologically rich properties of German by markovization and special features added to its productions. We evaluate the impact of adding lexical knowledge. Then we examine both monolingual and bilingual approaches to parse reranking. Our reranking parser is the new state of the art in constituency parsing of the TIGER Treebank. We perform an analysis, concluding with lessons learned, which apply to parsing other morphologically rich and less-configurational languages.