The effect of semi-supervised learning on parsing long distance dependencies in German and Swedish

  • Authors:
  • Anders Søgaard;Christian Rishøj

  • Affiliations:
  • Center for Language Technology, University of Copenhagen, Copenhagen S;Center for Language Technology, University of Copenhagen, Copenhagen S

  • Venue:
  • IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper shows how the best data-driven dependency parsers available today [1] can be improved by learning from unlabeled data. We focus on German and Swedish and show that labeled attachment scores improve by 1.5%-2.5%. Error analysis shows that improvements are primarily due to better recovery of long distance dependencies.