Speech enhancement using harmonic emphasis and adaptive comb filtering

Authors:
Wen Jin;Xin Liu;Michael S. Scordilis;Lu Han
Affiliations:
Qualcomm, San Diego, CA;Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL;Department of Electrical and Computer Engineering, University of Miami, Coral Gables, FL;Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, NC
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2010

Citing 5
Cited 1

HMM-Based Speech Enhancement Using Harmonic Modeling

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97)-Volume 2 - Volume 2
Audio signal noise reduction using multi-resolution sinusoidal modeling

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 02
Speech enhancement using a masking threshold constrained Kalman filter and its heuristic implementations

IEEE Transactions on Audio, Speech, and Language Processing
Generalized Postfilter for Speech Quality Enhancement

IEEE Transactions on Audio, Speech, and Language Processing
Evaluation of Objective Quality Measures for Speech Enhancement

IEEE Transactions on Audio, Speech, and Language Processing

Reduction of musical residual noise using block-and-directional-median filter adapted by harmonic properties

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

An enhancement method for single-channel speech degraded by additive noise is proposed. A spectral weighting function is derived by constrained optimization to suppress noise in the frequency domain. Two design parameters are included in the suppression gain, namely, the frequency-dependent noise-flooring parameter (FDNFP) and the gain factor. The FDNFP controls the level of admissible residual noise in the enhanced speech. Enhanced harmonic structures are incorporated into the FDNFP by time-domain processing of the linear prediction residuals of voiced speech. Further enhancement of the harmonics is achieved by adaptive comb filtering derived using the gain factor with a peak-picking algorithm. The performance of the enhancement method was evaluated by the modified bark spectral distance (MBSD), ITU-Perceptual Evaluation of Speech Quality (PESQ) scores, composite objective measures and listening tests. Experimental results indicate that the proposed method outperforms spectral subtraction; a main signal subspace method applicable to both white and colored noise conditions and a perceptually based enhancement method with a constant noise-flooring parameter, particularly at lower signal-to-noise ratio conditions. Our listening test indicated that 16 listeners on average preferred the proposed approach over any of the other three approaches about 73% of the time.