Planar languages and learnability

  • Authors:
  • Alexander Clark;Christophe Costa Florêncio;Chris Watkins;Mariette Serayet

  • Affiliations:
  • Department of Computer Science, University of London, Egham, UK;Department of Computer Science, University of London, Egham, UK;Department of Computer Science, University of London, Egham, UK;Faculté des Sciences et Techniques, Département Informatique, Saint-Etienne, France

  • Venue:
  • ICGI'06 Proceedings of the 8th international conference on Grammatical Inference: algorithms and applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Strings can be mapped into Hilbert spaces using feature maps such as the Parikh map. Languages can then be defined as the pre-image of hyperplanes in the feature space, rather than using grammars or automata. These are the planar languages. In this paper we show that using techniques from kernel-based learning, we can represent and efficiently learn, from positive data alone, various linguistically interesting context-sensitive languages. In particular we show that the cross-serial dependencies in Swiss German, that established the non-context-freeness of natural language, are learnable using a standard kernel. We demonstrate the polynomial-time identifiability in the limit of these classes, and discuss some language theoretic properties of these classes, and their relationship to the choice of kernel/feature map.