Robust integration for speech features

Authors:
Kuo-Chang Huang;Yau-Tarng Juang;Wen-Chieh Chang
Affiliations:
System Application Engineering Department, Uniband Electronic Corp., Hsinchu, Taiwan, ROC;Department of Electrical Engineering, National Central University, Chung-Li, Taiwan, ROC;Department of Electrical Engineering, National Central University, Chung-Li, Taiwan, ROC
Venue:
Signal Processing - Signal processing in UWB communications
Year:
2006

Citing 3
Cited 0

Acoustical and environmental robustness in automatic speech recognition

Acoustical and environmental robustness in automatic speech recognition
Assessment for automatic speech recognition II: NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems

Speech Communication - Special issue on speech processing in adverse conditions
A likelihood measure based on projection-based group delay scheme for Mandarin speech recognition in noise

Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Deployment of speech recognizers sometimes involves ambient conditions that are not present during the training phase of the operation. This environment mismatch is one major source of performance degradation for speech recognizers. This paper aims at the feature integration that is robust to the ambient environment change. Our consideration focuses on the speech cepstrum and the group delay spectrum (GDS), derived from linear prediction coefficients. We present a new feature integration approach for robust speech recognition in adverse conditions. Based on the robustness of cepstrum under clean environment and the robustness of GDS under noisy environment, it is effective by suitably combining the cepstral coefficient and the GDS coefficient for noise-resistance of speech in different condition mismatch. The performance of the proposed method is experimentally evaluated in speaker-independent isolated-word recognition task using the hidden Markov model (HMM) under various noise conditions. Noisy speech is simulated by adding noise sources taken from the NOISEX-92 database. Experimental results obtained show that the new robust feature is effective for the speech recognition with significant noise and yields better performance than other feature coefficients. A substantial increase in recognition accuracy is observed in all testing noise environments at all different SNRs.