Histogram equalization to model adaptation for robust speech recognition

Authors:
Youngjoo Suh;Hoirin Kim
Affiliations:
Korea Advanced Institute of Science and Technology, Daejeon, South Korea;Korea Advanced Institute of Science and Technology, Daejeon, South Korea
Venue:
EURASIP Journal on Advances in Signal Processing
Year:
2010

Citing 7
Cited 0

Cepstral parameter compensation for HMM recognition in noise

Speech Communication - Special issue on speech processing in adverse conditions
Cepstral domain segmental feature vector normalization for noise robust speech recognition

Speech Communication - Special issue on robust speech recognition
Digital Image Processing

Digital Image Processing
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
A vector Taylor series approach for environment-independent speech recognition

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Compensating acoustic mismatch using class-based histogram equalization for robust speech recognition

EURASIP Journal on Applied Signal Processing
The application of hidden Markov models in speech recognition

Foundations and Trends in Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a new model adaptation method based on the histogram equalization technique for providing robustness in noisy environments. The trained acoustic mean models of a speech recognizer are adapted into environmentally matched conditions by using the histogram equalization algorithm on a single utterance basis. For more robust speech recognition in the heavily noisy conditions, trained acoustic covariance models are efficiently adapted by the signal-to-noise ratio-dependent linear interpolation between trained covariance models and utterance-level sample covariance models. Speech recognition experiments on both the digit-based Aurora2 task and the large vocabulary-based task showed that the proposed model adaptation approach provides significant performance improvements compared to the baseline speech recognizer trained on the clean speech data.