Suppression of Late Reverberation Effect on Speech Signal Using Long-Term Multiple-step Linear Prediction

Authors:
K. Kinoshita;M. Delcroix;T. Nakatani;M. Miyoshi
Affiliations:
NTT Commun. Sci. Labs., NTT Corp., Kyoto;-;-;-
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2009

Citing 0
Cited 10

Fast communication: Signal-dependent constraints for perceptually motivated suppression of late reverberation

Signal Processing
Reverberation model-based decoding in the logmelspec domain for robust distant-talking speech recognition

IEEE Transactions on Audio, Speech, and Language Processing - Special issue on processing reverberant speech: methodologies and applications
Model-based feature enhancement for reverberant speech recognition

IEEE Transactions on Audio, Speech, and Language Processing - Special issue on processing reverberant speech: methodologies and applications
Speech dereverberation based on variance-normalized delayed linear prediction

IEEE Transactions on Audio, Speech, and Language Processing - Special issue on processing reverberant speech: methodologies and applications
Correlation-based and model-based blind single-channel late-reverberation suppression in noisy time-varying acoustical environments

IEEE Transactions on Audio, Speech, and Language Processing - Special issue on processing reverberant speech: methodologies and applications
Automatic speech recognition performance in different room acoustic environments with and without dereverberation preprocessing

Computer Speech and Language
Nonlinear speech coding model based on genetic programming

Applied Soft Computing
A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Linear Estimation Based Primary-Ambient Extraction for Stereo Audio Signals

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

A speech signal captured by a distant microphone is generally smeared by reverberation, which severely degrades automatic speech recognition (ASR) performance. One way to solve this problem is to dereverberate the observed signal prior to ASR. In this paper, a room impulse response is assumed to consist of three parts: a direct-path response, early reflections and late reverberations. Since late reverberations are known to be a major cause of ASR performance degradation, this paper focuses on dealing with the effect of late reverberations. The proposed method first estimates the late reverberations using long-term multi-step linear prediction, and then reduces the late reverberation effect by employing spectral subtraction. The algorithm provided good dereverberation with training data corresponding to the duration of one speech utterance, in our case, less than 6 s. This paper describes the proposed framework for both single-channel and multichannel scenarios. Experimental results showed substantial improvements in ASR performance with real recordings under severe reverberant conditions.