Parsing the SynTagRus treebank of Russian

  • Authors:
  • Joakim Nivre;Igor M. Boguslavsky;Leonid L. Iomdin

  • Affiliations:
  • Växjö University and Uppsala University;Universidad Politécnica de Madrid;Institute for Information Transmission Problems

  • Venue:
  • COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the first results on parsing the SynTagRus treebank of Russian with a data-driven dependency parser, achieving a labeled attachment score of over 82% and an unlabeled attachment score of 89%. A feature analysis shows that high parsing accuracy is crucially dependent on the use of both lexical and morphological features. We conjecture that the latter result can be generalized to richly inflected languages in general, provided that sufficient amounts of training data are available.