Wavelet-based speech enhancement using time-frequency adaptation

Authors:
Kun-Ching Wang
Affiliations:
Department of Information Technology & Communication, Shin Chien University, Kaohsiung, Taiwan
Venue:
EURASIP Journal on Advances in Signal Processing
Year:
2009

Citing 10
Cited 0

Assessment for automatic speech recognition II: NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems

Speech Communication - Special issue on speech processing in adverse conditions
Adaptive filter theory (3rd ed.)

Adaptive filter theory (3rd ed.)
Discrete Time Processing of Speech Signals

Discrete Time Processing of Speech Signals
Speech Enhancement Using Perceptual Wavelet Packet Decomposition and Teager Energy Operator

Journal of VLSI Signal Processing Systems
Psychoacoustics: Facts and Models

Psychoacoustics: Facts and Models
Wavelet speech enhancement based on time-scale adaptation

Speech Communication
The Teager energy based feature parameters for robust speech recognition in car noise

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 01
Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 200. on IEEE International Conference - Volume 02
Perceptual speech coding and enhancement using frame-synchronizedfast wavelet packet transform algorithms

IEEE Transactions on Signal Processing
De-noising by soft-thresholding

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

Wavelet denoising is commonly used for speech enhancement because of the simplicity of its implementation. However, the conventional methods generate the presence of musical residual noise while thresholding the background noise. The unvoiced components of speech are often eliminated from this method. In this paper, a novel algorithm of wavelet coefficient threshold (WCT) based on time-frequency adaptation is proposed. In addition, an unvoiced speech enhancement algorithm is also integrated into the system to improve the intelligibility of speech. The wavelet coefficient threshold (WCT) of each subband is first temporally adjusted according to the value of a posterior signal-to-noise ratio (SNR). To prevent the degradation of unvoiced sounds during noise, the algorithm utilizes a simple speech/noise detector (SND) and further divides speech signal into unvoiced and voiced sounds. Then, we apply appropriate wavelet thresholding according to voiced/unvoiced (V/U) decision. Based on the masking properties of human auditory system, a perceptual gain factor is adopted into wavelet thresholding for suppressing musical residual noise. Simulation results show that the proposed method is capable of reducing noise with little speech degradation and the overall performance is superior to several competitive methods.