Bayesian Noise Compensation of Time Trajectories of Spectral Coefficients for Robust Speech Recognition

  • Authors:
  • Ilyas Potamitis;Nikos Fakotakis;George K. Kokkinakis

  • Affiliations:
  • -;-;-

  • Venue:
  • TSD '01 Proceedings of the 4th International Conference on Text, Speech and Dialogue
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Our work presents a novel data driven compensation technique that modifies on-line the incoming spectral representation of degraded speech to approximate the features of high quality speech used to train a classifier. We apply the Bayesian inference framework to the degraded spectral coefficients based on modeling clean speech linear-spectrum with appropriate non-Gaussian distributions that allow maximum a-posteriori (MAP) closed form solution to be set. MAP solution leads to a soft threshold function applied and adapted to the spectral characteristics and noise variance of each spectral band. We perform extensive evaluation of our algorithm against white and coloured Gaussian noise in the context of Automatic Speech Recognition (ASR), and demonstrate its robustness in adverse conditions. The enhancement process comes at little to no extra computational overhead, thus achieving real time, on line performance.