Bayesian on-line spectral change point detection: a soft computing approach for on-line ASR

Authors:
M. F. Chowdhury;S. -A. Selouani;D. O'Shaughnessy
Affiliations:
INRS-EMT, Université du Québec, Montréal, Canada;Université de Moncton, Moncton, Canada;INRS-EMT, Université du Québec, Montréal, Canada
Venue:
International Journal of Speech Technology
Year:
2012

Citing 9
Cited 0

Fundamentals of speech recognition

Fundamentals of speech recognition
Compensation of channel and noise distortions combining normalization and speech enhancement techniques

Speech Communication - Special issue on noise robust ASR
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Acoustical and Environmental Robustness in Automatic Speech Recognition

Acoustical and Environmental Robustness in Automatic Speech Recognition
A Unified Compensation Approach for Speech Recognition in Severely Adverse Environment

ISUMA '03 Proceedings of the 4th International Symposium on Uncertainty Modelling and Analysis
On-line Stochastic Matching compensation for non-stationary noise

Computer Speech and Language
A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions

Computer Speech and Language
Joint evaluation of multiple speech patterns for speech recognition and training

Computer Speech and Language
Environmental Sniffing: Noise Knowledge Estimation for Robust Speech Systems

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Current automatic speech recognition (ASR) works in off-line mode and needs prior knowledge of the stationary or quasi-stationary test conditions for expected word recognition accuracy. These requirements limit the application of ASR for real-world applications where test conditions are highly non-stationary and are not known a priori. This paper presents an innovative frame dynamic rapid adaptation and noise compensation technique for tracking highly non-stationary noises and its application for on-line ASR. The proposed algorithm is based on a soft computing model using Bayesian on-line inference for spectral change point detection (BOSCPD) in unknown non-stationary noises. BOSCPD is tested with the MCRA noise tracking technique for on-line rapid environmental change learning in different non-stationary noise scenarios. The test results show that the proposed BOSCPD technique reduces the delay in spectral change point detection significantly compared to the baseline MCRA and its derivatives. The proposed BOSCPD soft computing model is tested for joint additive and channel distortions compensation (JAC)-based on-line ASR in unknown test conditions using non-stationary noisy speech samples from the Aurora 2 speech database. The simulation results for the on-line AR show significant improvement in recognition accuracy compared to the baseline Aurora 2 distributed speech recognition (DSR) in batch-mode.