Benchmarking of statistical dependency parsers for French

  • Authors:
  • Marie Candito;Joakim Nivre;Pascal Denis;Enrique Henestroza Anguiano

  • Affiliations:
  • Alpage (Université Paris/INRIA);Uppsala University;Alpage (Université Paris/INRIA);Alpage (Université Paris/INRIA)

  • Venue:
  • COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We compare the performance of three statistical parsing architectures on the problem of deriving typed dependency structures for French. The architectures are based on PCFGs with latent variables, graph-based dependency parsing and transition-based dependency parsing, respectively. We also study the influence of three types of lexical information: lemmas, morphological features, and word clusters. The results show that all three systems achieve competitive performance, with a best labeled attachment score over 88%. All three parsers benefit from the use of automatically derived lemmas, while morphological features seem to be less important. Word clusters have a positive effect primarily on the latent variable parser.