A Study of Variable-Parameter Gaussian Mixture Hidden Markov Modeling for Noisy Speech Recognition

Authors:
Xiaodong Cui;Yifan Gong
Affiliations:
Dept. of Electr. Eng., California Univ., Los Angeles, CA;-
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2007

Citing 0
Cited 4

A novel framework and training algorithm for variable-parameter hidden Markov models

IEEE Transactions on Audio, Speech, and Language Processing
Fast Algorithm and Efficient Implementation of GMM-Based Pattern Classifiers

Journal of Signal Processing Systems
Stereo hidden Markov modeling for noise robust speech recognition

Computer Speech and Language
Handling signal variability with contextual markovian models

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

To improve recognition performance in noisy environments, multicondition training is usually applied in which speech signals corrupted by a variety of noise are used in acoustic model training. Published hidden Markov modeling of speech uses multiple Gaussian distributions to cover the spread of the speech distribution caused by noise, which distracts the modeling of speech event itself and possibly sacrifices the performance on clean speech. In this paper, we propose a novel approach which extends the conventional Gaussian mixture hidden Markov model (GMHMM) by modeling state emission parameters (mean and variance) as a polynomial function of a continuous environment-dependent variable. At the recognition time, a set of HMMs specific to the given value of the environment variable is instantiated and used for recognition. The maximum-likelihood (ML) estimation of the polynomial functions of the proposed variable-parameter GMHMM is given within the expectation-maximization (EM) framework. Experiments on the Aurora 2 database show significant improvements of the variable-parameter Gaussian mixture HMMs compared to the conventional GMHMMs