Blind Model Selection for Automatic Speech Recognition in Reverberant Environments

  • Authors:
  • Laurent Couvreur;Christophe Couvreur

  • Affiliations:
  • Multitel—TCTS, Faculté Polytechnique de Mons, 1 Avenue Copernic, B-7000 Mons, Belgium;Speech & Language Technology Division, Scansoft, Inc., 32 Guldensporenpark, B-9820 Merelbeke, Belgium

  • Venue:
  • Journal of VLSI Signal Processing Systems
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This communication presents a new method for automatic speech recognition in reverberant environments. Our approach consists in the selection of the best acoustic model out of a library of models trained on artificially reverberated speech databases corresponding to various reverberant conditions. Given a speech utterance recorded within a reverberant room, a Maximum Likelihood estimate of the fullband room reverberation time is computed using a statistical model for short-term log-energy sequences of anechoic speech. The estimated reverberation time is then used to select the best acoustic model, i.e., the model trained on a speech database most closely matching the estimated reverberation time, which serves to recognize the reverberated speech utterance. The proposed model selection approach is shown to improve significantly recognition accuracy for a connected digit task in both simulated and real reverberant environments, outperforming standard channel normalization techniques.