Chunking German: an unsolved problem

  • Authors:
  • Sandra Kübler;Kathrin Beck;Erhard Hinrichs;Heike Telljohann

  • Affiliations:
  • Indiana University, Bloomington, IN;Universität Tübingen, Tübingen, Germany;Universität Tübingen, Tübingen, Germany;Universität Tübingen, Tübingen, Germany

  • Venue:
  • LAW IV '10 Proceedings of the Fourth Linguistic Annotation Workshop
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a CoNLL-style chunk representation for the Tübingen Treebank of Written German, which assumes a flat chunk structure so that each word belongs to at most one chunk. For German, such a chunk definition causes problems in cases of complex prenominal modification. We introduce a flat annotation that can handle these structures via a stranded noun chunk.