Estimation of consistent probabilistic context-free grammars

  • Authors:
  • Mark-Jan Nederhof;Giorgio Satta

  • Affiliations:
  • Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands;University of Padua, via Gradenigo, Padova, Italy

  • Venue:
  • HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider several empirical estimators for probabilistic context-free grammars, and show that the estimated grammars have the so-called consistency property, under the most general conditions. Our estimators include the widely applied expectation maximization method, used to estimate probabilistic context-free grammars on the basis of unannotated corpora. This solves a problem left open in the literature, since for this method the consistency property has been shown only under restrictive assumptions on the rules of the source grammar.