Subspace construction and selection for speaker recognition

  • Authors:
  • Yanhua Long;Wu Guo;Bin Ma;Eng Siong Chng;Donglai Zhu;Lirong Dai;Haizhou Li

  • Affiliations:
  • iFly Speech Lab, EEIS, University of Science and Technology of China, China;iFly Speech Lab, EEIS, University of Science and Technology of China, China and MOE, Microsoft Key Laboratory, Multimedia Computing and Communication, USTC, China;Institute for Infocomm Research, Singapore;Nanyang Technology University, Singapore;Institute for Infocomm Research, Singapore;iFly Speech Lab, EEIS, University of Science and Technology of China, China;Institute for Infocomm Research, Singapore and Nanyang Technology University, Singapore

  • Venue:
  • ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, we propose a subspace construction and selection strategy (SUBS) for speaker recognition with limited training and testing speech data. Based on the individual Gaussian distributions of Gaussian mixture model (GMM), each speaker's characteristic subspace is constructed by training an SVM using the corresponding Gaussian mean vectors from the GMMs of both enrollment and imposter speakers. A subspace selection based on the structure risk criterion is used to select those subspaces with lower structure risks. The selected subspaces are then combined and used to evaluate the test utterances. We evaluate this subspace strategy on the 10sec-10sec test condition in 2008 NIST Speaker Recognition Evaluations, achieving a relative 12.16% equal error rate reduction over the GMM Supervector baseline system.