Multichannel Cepstral Domain Feature Warping for Robust Speech Recognition

  • Authors:
  • Stefano Squartini;Marco Fagiani;Emanuele Principi;Francesco Piazza

  • Affiliations:
  • A3LAB, Department of Biomedics, Electronics and Telecommunications, Università Politecnica delle Marche, Italy;A3LAB, Department of Biomedics, Electronics and Telecommunications, Università Politecnica delle Marche, Italy;A3LAB, Department of Biomedics, Electronics and Telecommunications, Università Politecnica delle Marche, Italy;A3LAB, Department of Biomedics, Electronics and Telecommunications, Università Politecnica delle Marche, Italy

  • Venue:
  • Proceedings of the 2011 conference on Neural Nets WIRN10: Proceedings of the 20th Italian Workshop on Neural Nets
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Speech interfaced Human-Machine interfaces have been gaining an increasing interest among the scientific community and technology market. One of the key task to be faced within this architectures is speech recognition, for which a certain degree of understanding has been already reached in the literature. Several efforts have been oriented on purpose to take the presence of acoustic non-idealities, i.e. noise and acoustic reverberation, into account. Feature warping in cepstral domain is one of the most performing approaches: different techniques have been proposed so far and some of them are based on the histogram-equalization concept, which has shown to be effective. It employs a suitable transformation to the acoustically degraded cepstral coefficients in order to re-establish the clean speech statistical properties. The Quantile-based implementation has been taken here as reference. The main objective of this work is exploiting the availability of multichannel acoustic information, coming from a microphone array acquisition, in order to enhance the statistics modeling capabilities of the algorithm and therefore improve its speech enhancement effect. Some computer simulations based on the AURORA 2 database have been carried out and obtained results confirm the effectiveness of the idea.