Speech enhancement for automatic speech recognition using complex gaussian mixture priors for noise and speech

  • Authors:
  • Ramón F. Astudillo;Eugen Hoffmann;Philipp Mandelartz;Reinhold Orglmeister

  • Affiliations:
  • Department of Energy and Automation Technology, TU-Berlin, Berlin, Germany;Department of Energy and Automation Technology, TU-Berlin, Berlin, Germany;Department of Energy and Automation Technology, TU-Berlin, Berlin, Germany;Department of Energy and Automation Technology, TU-Berlin, Berlin, Germany

  • Venue:
  • NOLISP'09 Proceedings of the 2009 international conference on Advances in Nonlinear Speech Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Statistical speech enhancement methods often rely on a set of assumptions, like gaussianity of speech and noise processes or perfect knowledge of their parameters, that are not fully met in reality. Recent advancements have shown the potential improvement in speech enhancement obtained by employing supergaussian speech models conditioned on the estimated signal to noise ratio. In this paper we derive a supergaussian model for speech enhancement in which both speech and noise priors are assumed to be complex Gaussian mixture models. We introduce as well a method for the computation of the noise prior based on the noise variance estimator used. Finally, we compare the developed estimators with the conventional Ephraim-Malah filters in the context of robust automatic speech recognition.