Speaker identification and verification using Gaussian mixture speaker models
Speech Communication
Robust bootstrapping of speaker models for unsupervised speaker indexing
MCAM'07 Proceedings of the 2007 international conference on Multimedia content analysis and mining
Hi-index | 0.00 |
Speaker modeling technique with sparse training data is an active branch of robust speaker recognition research This paper presents a novel modeling approach named Multi-EigenSpace modeling technique based on Regression Class (RC-MES), which integrates the common eigenspace technique and the regression class (RC) idea of Maximum Likelihood Linear Regression (MLLR) RC-MES not only solves the problem of prior knowledge limitation of Gaussian Mixture Models (GMM) but also remedies the shortcoming of common eigenspace that confuses speaker differences and phoneme differences The eigenvoice analysis in RC can provide better discrimination ability between different speakers The experimental results on speaker identification of 75 males show that, when enrolment data is sparse, RC-MES provides significant improvement over GMM, and the number of eigenvoices in RC-MES is fewer than that in common eigenspace.