Correlation-based and model-based blind single-channel late-reverberation suppression in noisy time-varying acoustical environments

Authors:
Jan S. Erkelens;Richard Heusdens
Affiliations:
Department of Mediamatics, Delft University of Technology, Delft, The Netherlands;Department of Mediamatics, Delft University of Technology, Delft, The Netherlands
Venue:
IEEE Transactions on Audio, Speech, and Language Processing - Special issue on processing reverberant speech: methodologies and applications
Year:
2010

Citing 16
Cited 1

Enhanced modified bark spectral distortion (embsd): an objective speech quality measure based on audible distortion and cognition model

Enhanced modified bark spectral distortion (embsd): an objective speech quality measure based on audible distortion and cognition model
A data-driven approach to optimizing spectral speech enhancement methods for various error criteria

Speech Communication
A blind speech enhancement algorithm for the suppression of late reverberation and noise

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Single-microphone late-reverberation suppression in noisy speech by exploiting long-term correlation in the DFT domain

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Adaptive dereverberation of speech signals with speaker-position change detection

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Real-time speech enhancement in noisy reverberant multi-talker environments based on a location-independent room acoustics model

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Integrated speech enhancement method using noise suppression and dereverberation

IEEE Transactions on Audio, Speech, and Language Processing
A binaural room impulse response database for the evaluation of dereverberation algorithms

DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
Precise Dereverberation Using Multichannel Linear Prediction

IEEE Transactions on Audio, Speech, and Language Processing
Harmonicity-Based Blind Dereverberation for Single-Channel Speech Signals

IEEE Transactions on Audio, Speech, and Language Processing
Tracking of Nonstationary Noise Based on Data-Driven Recursive Noise Power Estimation

IEEE Transactions on Audio, Speech, and Language Processing
Suppression of Late Reverberation Effect on Speech Signal Using Long-Term Multiple-step Linear Prediction

IEEE Transactions on Audio, Speech, and Language Processing
A two-stage algorithm for one-microphone reverberant speech enhancement

IEEE Transactions on Audio, Speech, and Language Processing
Minimum Mean-Square Error Estimation of Discrete Fourier Coefficients With Generalized Gamma Priors

IEEE Transactions on Audio, Speech, and Language Processing
System Identification in the Short-Time Fourier Transform Domain With Crossband Filtering

IEEE Transactions on Audio, Speech, and Language Processing
Robust Speech Dereverberation Using Multichannel Blind Deconvolution With Spectral Subtraction

IEEE Transactions on Audio, Speech, and Language Processing

An Improved Method for Late-Reverberant Suppression Based on Statistical Model

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper considers suppression of late reverberation and additive noise in single-channel speech recordings. The reverberation introduces long-term correlation in the observed signal. In the first part of this work, we show how this correlation can be used to estimate the late reverberant spectral variance (LRSV) without having to assume a specific model for the room impulse responses (RIRs) while no explicit estimates of RIR model parameters are needed. That makes this correlation-based approach more robust against RIR modeling errors. However, the correlation-based method can follow only slow time variations in the RIRs. Existing model-based methods use statistical models for the RIRs, that depend on one or more parameters that have to be estimated blindly. The common statistical models lead to simple expressions for the LRSV that depend on past values of the spectral variance of the reverberant, noise-free, signal. All existing model-based LRSV estimators in the literature are derived assuming the RIRs to be time-invariant realizations of a stochastic process. In the second part of this paper, we go one step further and analyze time-varying RIRs. We show that in this case the reverberance tends to become decorrelated. We discuss the relations between different RIR models and their corresponding LRSV estimators. We show theoretically that similar simple estimators exist as in the time-invariant case, provided that the reverberation time T60 and direct-to-reverberation ratio (DRR) of the RIRs remain nearly constant during an interval of the order of a few frames. We show that the reverberation time can be taken frequency-bin independent in DFT-based enhancement algorithms. Experiments with time-varying RIRs validate the analysis. Experiments with additive nonstationary noise and time-invariant RIRs show the influence of blind estimation of the reverberation time and the DRR.