Discriminative classifiers with adaptive kernels for noise robust speech recognition

Authors:
M. J. F. Gales;F. Flego
Affiliations:
Cambridge University Engineering Department, Trumpington Street, Cambridge CB2 1PZ, United Kingdom;Cambridge University Engineering Department, Trumpington Street, Cambridge CB2 1PZ, United Kingdom
Venue:
Computer Speech and Language
Year:
2010

Citing 6
Cited 1

Exploiting generative models in discriminative classifiers

Proceedings of the 1998 conference on Advances in neural information processing systems II
Acoustical and Environmental Robustness in Automatic Speech Recognition

Acoustical and Environmental Robustness in Automatic Speech Recognition
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
Speech recognition in noisy environments

Speech recognition in noisy environments
Probability Estimates for Multi-class Classification by Pairwise Coupling

The Journal of Machine Learning Research
Maximum entropy direct models for speech recognition

IEEE Transactions on Audio, Speech, and Language Processing

Importance sampling to compute likelihoods of noise-corrupted speech

Computer Speech and Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

Discriminative classifiers are a popular approach to solving classification problems. However, one of the problems with these approaches, in particular kernel based classifiers such as support vector machines (SVMs), is that they are hard to adapt to mismatches between the training and test data. This paper describes a scheme for overcoming this problem for speech recognition in noise by adapting the kernel rather than the SVM decision boundary. Generative kernels, defined using generative models, are one type of kernel that allows SVMs to handle sequence data. By compensating the parameters of the generative models for each noise condition noise-specific generative kernels can be obtained. These can be used to train a noise-independent SVM on a range of noise conditions, which can then be used with a test-set noise kernel for classification. The noise-specific kernels used in this paper are based on Vector Taylor Series (VTS) model-based compensation. VTS allows all the model parameters to be compensated and the background noise to be estimated in a maximum likelihood fashion. A brief discussion of VTS, and the optimisation of the mismatch function representing the impact of noise on the clean speech, is also included. Experiments using these VTS-based test-set noise kernels were run on the AURORA 2 continuous digit task. The proposed SVM rescoring scheme yields large gains in performance over the VTS compensated models.