Enhanced modified bark spectral distortion (embsd): an objective speech quality measure based on audible distortion and cognition model
A blind speech enhancement algorithm for the suppression of late reverberation and noise
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Adaptive dereverberation of speech signals with speaker-position change detection
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Integrated speech enhancement method using noise suppression and dereverberation
IEEE Transactions on Audio, Speech, and Language Processing
A binaural room impulse response database for the evaluation of dereverberation algorithms
DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
Precise Dereverberation Using Multichannel Linear Prediction
IEEE Transactions on Audio, Speech, and Language Processing
Harmonicity-Based Blind Dereverberation for Single-Channel Speech Signals
IEEE Transactions on Audio, Speech, and Language Processing
Tracking of Nonstationary Noise Based on Data-Driven Recursive Noise Power Estimation
IEEE Transactions on Audio, Speech, and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing
A two-stage algorithm for one-microphone reverberant speech enhancement
IEEE Transactions on Audio, Speech, and Language Processing
Minimum Mean-Square Error Estimation of Discrete Fourier Coefficients With Generalized Gamma Priors
IEEE Transactions on Audio, Speech, and Language Processing
System Identification in the Short-Time Fourier Transform Domain With Crossband Filtering
IEEE Transactions on Audio, Speech, and Language Processing
Robust Speech Dereverberation Using Multichannel Blind Deconvolution With Spectral Subtraction
IEEE Transactions on Audio, Speech, and Language Processing
An Improved Method for Late-Reverberant Suppression Based on Statistical Model
Speech Communication
Hi-index | 0.00 |
This paper considers suppression of late reverberation and additive noise in single-channel speech recordings. The reverberation introduces long-term correlation in the observed signal. In the first part of this work, we show how this correlation can be used to estimate the late reverberant spectral variance (LRSV) without having to assume a specific model for the room impulse responses (RIRs) while no explicit estimates of RIR model parameters are needed. That makes this correlation-based approach more robust against RIR modeling errors. However, the correlation-based method can follow only slow time variations in the RIRs. Existing model-based methods use statistical models for the RIRs, that depend on one or more parameters that have to be estimated blindly. The common statistical models lead to simple expressions for the LRSV that depend on past values of the spectral variance of the reverberant, noise-free, signal. All existing model-based LRSV estimators in the literature are derived assuming the RIRs to be time-invariant realizations of a stochastic process. In the second part of this paper, we go one step further and analyze time-varying RIRs. We show that in this case the reverberance tends to become decorrelated. We discuss the relations between different RIR models and their corresponding LRSV estimators. We show theoretically that similar simple estimators exist as in the time-invariant case, provided that the reverberation time T60 and direct-to-reverberation ratio (DRR) of the RIRs remain nearly constant during an interval of the order of a few frames. We show that the reverberation time can be taken frequency-bin independent in DFT-based enhancement algorithms. Experiments with time-varying RIRs validate the analysis. Experiments with additive nonstationary noise and time-invariant RIRs show the influence of blind estimation of the reverberation time and the DRR.