Combining diverse word-alignment symmetrizations improves dependency tree projection

  • Authors:
  • David Mareček

  • Affiliations:
  • Charles University in Prague, Institute of Formal and Applied Linguistics

  • Venue:
  • CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

For many languages, we are not able to train any supervised parser, because there are no manually annotated data available. This problem can be solved by using a parallel corpus with English, parsing the English side, projecting the dependencies through word-alignment connections, and training a parser on the projected trees. In this paper, we introduce a simple algorithm using a combination of various wordalignment symmetrizations. We prove that our method outperforms previous work, even though it uses McDonald's maximum-spanning-tree parser as it is, without any "unsupervised" modifications.