Over-attenuated components regeneration for speech enhancement

Authors:
Huijun Ding;Ing Yann Soon;Chai Kiat Yeo
Affiliations:
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore;School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore;School of Computer Engineering, Nanyang Technological University, Singapore
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2010

Citing 4
Cited 0

A spectral filtering method based on hybrid wiener filters for speech enhancement

Speech Communication
A post-processing technique for regeneration of over-attenuated speech

ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Improved Signal-to-Noise Ratio Estimation for Speech Enhancement

IEEE Transactions on Audio, Speech, and Language Processing
Evaluation of Objective Quality Measures for Speech Enhancement

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Despite the quality improvement of the speech signal with most traditional noise reduction (TNR) algorithms, the output is always distorted to some extent due to the over-attenuation of speech components. Weak speech components are usually regarded as noise in noise reduction processing and are therefore highly suppressed. In this paper, we propose a postprocessing technique which is based on the regeneration of both the voiced and unvoiced speech in the entire frequency domain to reduce this problem. A nonlinear transform is first applied to obtain the excitation signal, and a smooth envelope is then estimated. To utilize the information of the clean speech contained in the envelope, we combine the original TNR filter output with a weighted product of the excitation signal and the estimated envelope to generate the final synthesized speech. The synthesized speech is quite close to the clean speech and is more natural-sounding. Moreover, our algorithm can mask the residual musical noise effectively with the regenerated speech components. Experimental results demonstrate the excellent performance of our algorithm. In addition, we introduce two novel objective measures and further show the efficiency of our algorithm in maintaining the clean speech while reducing the noise as much as possible.