Speech enhancement from noise: a regenerative approach
Speech Communication
Noisy speech enhancement using discrete cosine transform
Speech Communication
Rao-Blackwellised Particle Filtering for Dynamic Bayesian Networks
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Speech enhancement by map spectral amplitude estimation using a super-Gaussian speech model
EURASIP Journal on Applied Signal Processing
Minimum Subspace Noise Tracking for noise Power Spectral Density estimation
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
A tutorial on particle filters for online nonlinear/non-GaussianBayesian tracking
IEEE Transactions on Signal Processing
Monte Carlo smoothing with application to audio signal enhancement
IEEE Transactions on Signal Processing
Evaluation of Objective Quality Measures for Speech Enhancement
IEEE Transactions on Audio, Speech, and Language Processing
Analysis of linear prediction, coding, and spectral estimation from subbands
IEEE Transactions on Information Theory
Particle filter enhancement of speech spectral amplitudes
IEEE Transactions on Audio, Speech, and Language Processing
International Journal of Speech Technology
Hi-index | 0.00 |
A discrete cosine transform (DCT) domain speech enhancement algorithm is proposed that models the evolution of speech DCT coefficients as a time-varying autoregressive process. Rao-Blackwellized particle filter (RBPF) techniques are used to estimate the model parameters and recover the clean signal coefficients. Using very low-order models for each coefficient and operating at a decimated frame rate, the proposed approach provides a significant complexity reduction compared to the standard full-band RBPF speech enhancement algorithm. In addition to the complexity gains, performance is also improved. Modeling the speech signal in the DCT-domain is shown to provide a better fit in spectral troughs, leading to more noise reduction and less speech distortion. To illustrate possible frequency-dependent processing strategies, a hybrid structure is proposed that offers a complexity/performance trade-off by substituting a simple DCT Wiener filter for the DCT-RBPF in some bands. In comparisons with high performing speech enhancement algorithms using wideband speech and noise, the proposed DCT-RBPF algorithm achieves higher scores on objective quality and intelligibility measures.