DCG induction using MDL and parsed corpora

  • Authors:
  • Miles Osborne

  • Affiliations:
  • Univ. of Groningen, Groningen, The Netherlands

  • Venue:
  • Learning language in logic
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

We show how partial models of natural language syntax (manually written DCGs, with parameters estimated from a parsed corpus) can be automatically extended when trained upon raw text (using MDL). We also show how we can use a parsed corpus as an alternative constraint upon learning. Empirical evaluation suggests that a parsed corpus is more informative than a MDL-based prior. However, best results are achieved when the learner is supervised with a compressionbased prior and a parsed corpus.