Learning n-ary node selecting tree transducers from completely annotated examples

  • Authors:
  • A. Lemay;J. Niehren;R. Gilleron

  • Affiliations:
  • Mostrare project of INRIA Futurs, LIFL, University of Lille 3, Lille, France;Mostrare project of INRIA Futurs, LIFL, INRIA Futurs, Lille, France;Mostrare project of INRIA Futurs, LIFL, University of Lille 3, Lille, France

  • Venue:
  • ICGI'06 Proceedings of the 8th international conference on Grammatical Inference: algorithms and applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the first algorithm for learning n-ary node selection queries in trees from completely annotated examples by methods of grammatical inference. We propose to represent n-ary queries by deterministic n-ary node selecting tree transducers (n-NSTTs). These are tree automata that capture the class of monadic second-order definable n-ary queries. We show that n-NSTTs defined polynomially bounded n-ary queries can be learned from polynomial time and data. An application in Web information extraction yields encouraging results.