Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments

Authors:
D. Malah;R. V. Cox;A. J. Accardi
Affiliations:
AT&TBell Labs., Florham Park, NJ, USA;-;-
Venue:
ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 02
Year:
1999

Citing 0
Cited 16

Evaluation of an ERB frequency scale noise reduction for hearing aids: a comparative study

Speech Communication - Special issue on speech processing for hearing aids
Wavelet speech enhancement based on time-scale adaptation

Speech Communication
Uniform and warped low delay filter-banks for speech enhancement

Speech Communication
Speech enhancement by map spectral amplitude estimation using a super-Gaussian speech model

EURASIP Journal on Applied Signal Processing
A noise reduction preprocessor for mobile voice communication

EURASIP Journal on Applied Signal Processing
Speech enhancement by joint statistical characterization in the Log Gabor Wavelet domain

Speech Communication
Speech enhancement based on undecimated wavelet packet-perceptual filterbanks and MMSE--STSA estimation in various noise environments

Digital Signal Processing
Wavelet-Based Speech Enhancement Using Time-Adapted Noise Estimation

IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Incorporating the conditional speech presence probability in multi-channel Wiener filter based noise reduction in hearing aids

EURASIP Journal on Advances in Signal Processing - Special issue on digital signal processing for hearing instruments
Fast communication: Minima-controlled speech presence uncertainty tracking method for speech enhancement

Signal Processing
Use of speech presence uncertainty with MMSE spectral energy estimation for robust automatic speech recognition

Speech Communication
MMSE estimation of log-filterbank energies for robust speech recognition

Speech Communication
Generalized gamma distributed Bayesian estimator under speech presence probability

ACS'11 Proceedings of the 11th WSEAS international conference on Applied computer science
Non-negative matrix factorization based noise reduction for noise robust automatic speech recognition

LVA/ICA'12 Proceedings of the 10th international conference on Latent Variable Analysis and Signal Separation
Optimal speech enhancement under signal presence uncertainty using Log Gabor Wavelet and Bayesian Joint Statistics

International Journal of Speech Technology
Speech enhancement using generalized weighted β-order spectral amplitude estimator

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

Speech enhancement algorithms which are based on estimating the short-time spectral amplitude of the clean speech have better performance when a soft-decision gain modification, depending on the a priori probability of speech absence, is used. In reported works a fixed probability, q, is assumed. Since speech is non-stationary and may not be present in every frequency bin when voiced, we propose a method for estimating distinct values of q for different bins which are tracked in time. The estimation is based on a decision-theoretic approach for setting a threshold in each bin followed by short-time averaging. The estimated q's are used to control both the gain and the update of the estimated noise spectrum during speech presence in a modified MMSE log-spectral amplitude estimator. Subjective tests resulted in higher scores than for the IS-127 standard enhancement algorithm, when pre-processing noisy speech for a coding application.