α-Gaussian mixture modelling for speaker recognition

  • Authors:
  • Dalei Wu;Ji Li;Haiqing Wu

  • Affiliations:
  • Department of Computer Science and Engineering, York University, 4700 Keele Street, Toronto, Ontario, Canada M3J 1P3;Department of Computer Science and Engineering, York University, 4700 Keele Street, Toronto, Ontario, Canada M3J 1P3;Department of Computer Science and Engineering, York University, 4700 Keele Street, Toronto, Ontario, Canada M3J 1P3

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2009

Quantified Score

Hi-index 0.10

Visualization

Abstract

Gaussian mixture model is the conventional approach employed in speaker recognition tasks. Although it is efficient to model specific speaking characteristics of a speaker, especially in quiet environments, its performance in noisy conditions is still far from the human cognitive process. Recently, a new method of @a-integration of stochastic models has been proposed based on psychophysical experiments that suggests @a-integration is used in a human brain. In this paper, we proposed a method to extend the conventional GMM to the @a-integrated GMM (@a-GMM) to model personal speaking traits. Model parameters were re-estimated recursively based on a given data set. The experiments showed that the new approach significantly outperforms the traditional method, especially on telephony speech.