A Polynomial Algorithm for the Inference of Context Free Languages

  • Authors:
  • Alexander Clark;Rémi Eyraud;Amaury Habrard

  • Affiliations:
  • Department of Computer Science, Royal Holloway, University of London,;Laboratoire d'Informatique Fondamentale, University of Aix-Marseille, CNRS,;Laboratoire d'Informatique Fondamentale, University of Aix-Marseille, CNRS,

  • Venue:
  • ICGI '08 Proceedings of the 9th international colloquium on Grammatical Inference: Algorithms and Applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a polynomial algorithm for the inductive inference of a large class of context free languages, that includes all regular languages. The algorithm uses a representation which we call Binary Feature Grammars based on a set of features, capable of representing richly structured context free languages as well as some context sensitive languages. More precisely, we focus on a particular case of this representation where the features correspond to contexts appearing in the language. Using the paradigm of positive data and a membership oracle, we can establish that all context free languages that satisfy two constraints on the context distributions can be identified in the limit by this approach. The polynomial time algorithm we propose is based on a generalisation of distributional learning and uses the lattice of context occurrences. The formalism and the algorithm seem well suited to natural language and in particular to the modelling of first language acquisition.