The integration of principal component analysis and cepstral mean subtraction in parallel model combination for robust speech recognition

Authors:
Hadi Veisi;Hossein Sameti
Affiliations:
Department of Computer Engineering, Sharif University of Technology, Tehran, Iran;Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
Venue:
Digital Signal Processing
Year:
2011

Citing 18
Cited 2

Fundamentals of speech recognition

Fundamentals of speech recognition
Cepstral parameter compensation for HMM recognition in noise

Speech Communication - Special issue on speech processing in adverse conditions
Speech recognition in noisy environments using first-order vector Taylor series

Speech Communication
Assessing local noise level estimation methods: application to noise robust ASR

Speech Communication - Special issue on noise robust ASR
Robust automatic speech recognition with missing and unreliable acoustic data

Speech Communication
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Discrete Time Processing of Speech Signals

Discrete Time Processing of Speech Signals
Experiments in Speaker Normalisation and Adaptation for Large Vocabulary Speech Recognition

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Fast Speech Recognition Algorithm Under Noisy Environment Using Modified CMS-PMC and Improved IDMM+SQ

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Speech recognition in noisy environments

Speech recognition in noisy environments
Independent calculation of power parameters on PMC method

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
An improved noise compensation algorithm for speech recognition in noise

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Improving environmental robustness in large vocabulary speech recognition

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
A segment-based C/sub 0/ adaptation scheme for PMC-based noisy Mandarin speech recognition

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Improved parallel model combination techniques with split Gaussian mixtures for speech recognition under noisy conditions

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
PCA-PMC: a novel use of a priori knowledge for fast parallel model combination

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
Feature compensation in the cepstral domain employing model combination

Speech Communication
An improved approach to the hidden Markov model decomposition of speech and noise

ICASSP'92 Proceedings of the 1992 IEEE international conference on Acoustics, speech and signal processing - Volume 1

Speech enhancement using hidden Markov models in Mel-frequency domain

Speech Communication
Autoregressive modeling of speech trajectory transformed to the reconstructed phase space for ASR purposes

Digital Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses the problem of automatic speech recognition in real applications in which the speech signal is altered by various noises. Feature compensation and model compensation robustness methods are studied. Parallel model combination (PMC) and its recent advances are reviewed and a novel algorithm called PC-PMC is proposed. This algorithm utilizes cepstral mean subtraction (CMS) normalization ability and principal component analysis (PCA) compression and de-correlation capability in the combination with PMC model transformation method. PC-PMC algorithm takes the advantages of additive noise compensation ability of PMC and convolutional noise removal capability of CMS and PCA. In realizing PC-PMC mainly two problems should be solved. The first problem is that PMC method requires invertible modules in the front-end of the system while CMS normalization is not an invertible process. Moreover, when the recognition system is exposed to noisy speech, the adaptation of the PCA transform is required; therefore a framework is to be designed for adaptation of the PCA transform in the presence of noise. The method presented in this paper provides solutions to these problems. Our evaluations are done on the four different real noisy tasks using Nevisa HMM-based, Persian continuous speech recognition system. Experimental results demonstrate significant reduction in the system word error rate using PC-PMC. In addition, the effects of covariance matrix compensation, dynamic features adaptation and the effect of gain parameter in the PMC method are also studied and experimented in the real acoustic conditions. Besides, we have investigated the using of maximum likelihood linear regression (MLLR) and maximum a posteriori (MAP) adaptation techniques in the combination with the introduced PC-PMC method. Finally, a comprehensive discussion on the capabilities of the studied robustness methods is presented.