Automatic speech recognition performance in different room acoustic environments with and without dereverberation preprocessing

  • Authors:
  • Alexandros Tsilfidis;Iosif Mporas;John Mourjopoulos;Nikos Fakotakis

  • Affiliations:
  • Wire Communications Laboratory, University of Patras, Greece;Wire Communications Laboratory, University of Patras, Greece;Wire Communications Laboratory, University of Patras, Greece;Wire Communications Laboratory, University of Patras, Greece

  • Venue:
  • Computer Speech and Language
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The performance of recent dereverberation methods for reverberant speech preprocessing prior to Automatic Speech Recognition (ASR) is compared for an extensive range of room and source-receiver configurations. It is shown that room acoustic parameters such as the clarity (C50) and the definition (D50) correlate well with the ASR results. When available, such room acoustic parameters can provide insight into reverberant speech ASR performance and potential improvement via dereverberation preprocessing. It is also shown that the application of a recent dereverberation method based on perceptual modelling can be used in the above context and achieve significant Phone Recognition (PR) improvement, especially under highly reverberant conditions.