How bad is the problem of PP-attachment?: a comparison of English, German and Swedish

  • Authors:
  • Martin Volk

  • Affiliations:
  • Stockholm University, Stockholm, Sweden

  • Venue:
  • Prepositions '06 Proceedings of the Third ACL-SIGSEM Workshop on Prepositions
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The correct attachment of prepositional phrases (PPs) is a central disambiguation problem in parsing natural languages. This paper compares the baseline situation in English, German and Swedish based on manual PP attachments in various treebanks for these languages. We argue that cross-language comparisons of the disambiguation results in previous research is impossible because of the different selection procedures when building the training and test sets. We perform uniform tree-bank queries and show that English has the highest noun attachment rate followed by Swedish and German. We also show that the high rate in English is dominated by the preposition of. From our study we derive a list of criteria for profiling data sets for PP attachment experiments.