Learning languages with rational kernels

Authors:
Corinna Cortes;Leonid Kontorovich;Mehryar Mohri
Affiliations:
Google Research, New York, NY;Carnegie Mellon University, Pittsburgh, PA;Courant Institute of Mathematical Sciences, New York, NY and Google Research, New York, NY
Venue:
COLT'07 Proceedings of the 20th annual conference on Learning theory
Year:
2007

Citing 15
Cited 5

Semirings, automata, languages

Semirings, automata, languages
A training algorithm for optimal margin classifiers

COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
Support-Vector Networks

Machine Learning
Generalization performance of support vector machines and other pattern classifiers

Advances in kernel methods
Automata, Languages, and Machines

Automata, Languages, and Machines
Automata: Theoretic Aspects of Formal Power Series

Automata: Theoretic Aspects of Formal Power Series
Piecewise testable events

Proceedings of the 2nd GI Conference on Automata Theory and Formal Languages
Finite-state transducers in language and speech processing

Computational Linguistics
Rational Kernels: Theory and Algorithms

The Journal of Machine Learning Research
Moment Kernels for Regular Distributions

Machine Learning
Introduction to probabilistic automata (Computer science and applied mathematics)

Introduction to probabilistic automata (Computer science and applied mathematics)
Kernel methods for learning languages

Theoretical Computer Science
Learning linearly separable languages

ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Learning rational stochastic languages

COLT'06 Proceedings of the 19th annual conference on Learning Theory
Efficient computation of the relative entropy of probabilistic automata

LATIN'06 Proceedings of the 7th Latin American conference on Theoretical Informatics

Kernel methods for learning languages

Theoretical Computer Science
String Kernels Based on Variable-Length-Don't-Care Patterns

DS '08 Proceedings of the 11th International Conference on Discovery Science
On the learnability of shuffle ideals

ALT'12 Proceedings of the 23rd international conference on Algorithmic Learning Theory
Rational kernels for arabic text classification

SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
On the learnability of shuffle ideals

The Journal of Machine Learning Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a general study of learning and linear separability with rational kernels, the sequence kernels commonly used in computational biology and natural language processing. We give a characterization of the class of all languages linearly separable with rational kernels and prove several properties of the class of languages linearly separable with a fixed rational kernel. In particular, we show that for kernels with transducer values in a finite set, these languages are necessarily finite Boolean combinations of preimages by a transducer of a single sequence. We also analyze the margin properties of linear separation with rational kernels and show that kernels with transducer values in a finite set guarantee a positive margin and lead to better learning guarantees. Creating a rational kernel with values in a finite set is often non-trivial even for relatively simple cases. However, we present a novel and general algorithm, double-tape disambiguation, that takes as input a transducer mapping sequences to sequence features, and yields an associated transducer that defines a finite range rational kernel. We describe the algorithm in detail and show its application to several cases of interest.