Large-scale training of SVMs with automata kernels

  • Authors:
  • Cyril Allauzen;Corinna Cortes;Mehryar Mohri

  • Affiliations:
  • Google Research, New York, NY;Google Research, New York, NY;Courant Institute of Mathematical Sciences and Google Research, New York, NY

  • Venue:
  • CIAA'10 Proceedings of the 15th international conference on Implementation and application of automata
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a novel application of automata algorithms to machine learning. It introduces the first optimization solution for support vector machines used with sequence kernels that is purely based on weighted automata and transducer algorithms, without requiring any specific solver. The algorithms presented apply to a family of kernels covering all those commonly used in text and speech processing or computational biology. We show that these algorithms have significantly better computational complexity than previous ones and report the results of large-scale experiments demonstrating a dramatic reduction of the training time, typically by several orders of magnitude.