A speech recognizer optimally combining learning vector quantization, dynamic programming and multi-layer perceptron

  • Authors:
  • Xavier Driancourt;Patrick Gallinari

  • Affiliations:
  • CNRS, Université Paris, Orsay Cedex, France;CNRS, Université Paris, Orsay Cedex, France

  • Venue:
  • ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1
  • Year:
  • 1992

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we give a detailed description of a new hybrid system for accoustic decoding. It uses a cooperation between a multi-layer perceptron (MLP) and an adaptive dynamic programming (DP) module. We show how to train the whole system in an optimal way using an adaptive gradient technique. The DP module optimizes cost functions inspired from k-means and learning vector quantization (L VQ). This module allows the training of synthetic references which incorporate discriminant information and improves the performances and/or speed of usual dynamic programming systems. We analyse and provide solutions to some problems which may occur when training the whole hybrid system and show that they are common to many modular architectures. These theoretical issues are illustrated through experiments on an isolated-word database.