Relevant Representations for the Inference of Rational Stochastic Tree Languages

  • Authors:
  • François Denis;Édouard Gilbert;Amaury Habrard;Faïssal Ouardi;Marc Tommasi

  • Affiliations:
  • Laboratoire d'Informatique Fondamentale, CNRS, Aix-Marseille Université,;Laboratoire d'Informatique Fondamentale de Lille (L.I.F.L.), INRIA and É.N.S. Cachan,;Laboratoire d'Informatique Fondamentale, CNRS, Aix-Marseille Université,;Laboratoire d'Informatique Fondamentale, CNRS, Aix-Marseille Université,;Laboratoire d'Informatique Fondamentale de Lille (L.I.F.L.), INRIA and É.N.S. Cachan,

  • Venue:
  • ICGI '08 Proceedings of the 9th international colloquium on Grammatical Inference: Algorithms and Applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recently, an algorithm - DEES - was proposed for learning rational stochastic tree languages. Given a sample of trees independently and identically drawn according to a distribution defined by a rational stochastic language, DEES outputs a linear representation of a rational series which converges to the target. DEES can then be used to identify in the limit with probability one rational stochastic tree languages. However, when DEES deals with finite samples, it often outputs a rational tree series which does not define a stochastic language. Moreover, the linear representation can not be directly used as a generative model. In this paper, we show that any representation of a rational stochastic tree language can be transformed in a reduced normalised representation that can be used to generate trees from the underlying distribution. We also study some properties of consistency for rational stochastic tree languages and discuss their implication for the inference. We finally consider the applicability of DEES to trees built over an unranked alphabet.