Subspace construction and selection for speaker recognition

Authors:
Yanhua Long;Wu Guo;Bin Ma;Eng Siong Chng;Donglai Zhu;Lirong Dai;Haizhou Li
Affiliations:
iFly Speech Lab, EEIS, University of Science and Technology of China, China;iFly Speech Lab, EEIS, University of Science and Technology of China, China and MOE, Microsoft Key Laboratory, Multimedia Computing and Communication, USTC, China;Institute for Infocomm Research, Singapore;Nanyang Technology University, Singapore;Institute for Infocomm Research, Singapore;iFly Speech Lab, EEIS, University of Science and Technology of China, China;Institute for Infocomm Research, Singapore and Nanyang Technology University, Singapore
Venue:
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
Year:
2009

Citing 2
Cited 0

SVMTorch: support vector machines for large-scale regression problems

The Journal of Machine Learning Research
Joint Factor Analysis Versus Eigenchannels in Speaker Recognition

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this paper, we propose a subspace construction and selection strategy (SUBS) for speaker recognition with limited training and testing speech data. Based on the individual Gaussian distributions of Gaussian mixture model (GMM), each speaker's characteristic subspace is constructed by training an SVM using the corresponding Gaussian mean vectors from the GMMs of both enrollment and imposter speakers. A subspace selection based on the structure risk criterion is used to select those subspaces with lower structure risks. The selected subspaces are then combined and used to evaluate the test utterances. We evaluate this subspace strategy on the 10sec-10sec test condition in 2008 NIST Speaker Recognition Evaluations, achieving a relative 12.16% equal error rate reduction over the GMM Supervector baseline system.