Fundamentals of speech recognition
Fundamentals of speech recognition
Speech Communication - Special issue on noise robust ASR
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Acoustical and Environmental Robustness in Automatic Speech Recognition
Acoustical and Environmental Robustness in Automatic Speech Recognition
A Unified Compensation Approach for Speech Recognition in Severely Adverse Environment
ISUMA '03 Proceedings of the 4th International Symposium on Uncertainty Modelling and Analysis
On-line Stochastic Matching compensation for non-stationary noise
Computer Speech and Language
Computer Speech and Language
Joint evaluation of multiple speech patterns for speech recognition and training
Computer Speech and Language
Environmental Sniffing: Noise Knowledge Estimation for Robust Speech Systems
IEEE Transactions on Audio, Speech, and Language Processing
Hi-index | 0.00 |
Current automatic speech recognition (ASR) works in off-line mode and needs prior knowledge of the stationary or quasi-stationary test conditions for expected word recognition accuracy. These requirements limit the application of ASR for real-world applications where test conditions are highly non-stationary and are not known a priori. This paper presents an innovative frame dynamic rapid adaptation and noise compensation technique for tracking highly non-stationary noises and its application for on-line ASR. The proposed algorithm is based on a soft computing model using Bayesian on-line inference for spectral change point detection (BOSCPD) in unknown non-stationary noises. BOSCPD is tested with the MCRA noise tracking technique for on-line rapid environmental change learning in different non-stationary noise scenarios. The test results show that the proposed BOSCPD technique reduces the delay in spectral change point detection significantly compared to the baseline MCRA and its derivatives. The proposed BOSCPD soft computing model is tested for joint additive and channel distortions compensation (JAC)-based on-line ASR in unknown test conditions using non-stationary noisy speech samples from the Aurora 2 speech database. The simulation results for the on-line AR show significant improvement in recognition accuracy compared to the baseline Aurora 2 distributed speech recognition (DSR) in batch-mode.