Environmental robust speech and speaker recognition through multi-channel histogram equalization

  • Authors:
  • Stefano Squartini;Emanuele Principi;Rudy Rotili;Francesco Piazza

  • Affiliations:
  • 3MediaLabs, Department of Information Engineering, Universití Politecnica delle Marche, Via Brecce Bianche 1, 60131, Ancona, Italy;3MediaLabs, Department of Information Engineering, Universití Politecnica delle Marche, Via Brecce Bianche 1, 60131, Ancona, Italy;3MediaLabs, Department of Information Engineering, Universití Politecnica delle Marche, Via Brecce Bianche 1, 60131, Ancona, Italy;3MediaLabs, Department of Information Engineering, Universití Politecnica delle Marche, Via Brecce Bianche 1, 60131, Ancona, Italy

  • Venue:
  • Neurocomputing
  • Year:
  • 2012

Quantified Score

Hi-index 0.01

Visualization

Abstract

Feature statistics normalization in the cepstral domain is one of the most performing approaches for robust automaticspeech and speaker recognition in noisy acoustic scenarios: feature coefficients are normalized by using suitable linear or nonlinear transformations in order to match the noisy speech statistics to the clean speech one. Histogram equalization (HEQ) belongs to such a category of algorithms and has proved to be effective on purpose and therefore taken here as reference. In this paper the presence of multi-channel acoustic channels is used to enhance the statistics modeling capabilities of the HEQ algorithm, by exploiting the availability of multiple noisy speech occurrences, with the aim of maximizing the effectiveness of the cepstra normalization process. Computer simulations based on the Aurora 2 database in speech and speaker recognition scenarios have shown that a significant recognition improvement with respect to the single-channel counterpart and other multi-channel techniques can be achieved confirming the effectiveness of the idea. The proposed algorithmic configuration has also been combined with the kernel estimation technique in order to further improve the speech recognition performances.