Joint map adaptation of feature transformation and Gaussian Mixture Model for speaker recognition

Authors:
Donglai Zhu;Bin Ma;Haizhou Li
Affiliations:
Institute for Infocomm Research, A*Star, 1 Fusionopolis Way, Singapore 138632;Institute for Infocomm Research, A*Star, 1 Fusionopolis Way, Singapore 138632;Institute for Infocomm Research, A*Star, 1 Fusionopolis Way, Singapore 138632
Venue:
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Year:
2009

Citing 0
Cited 2

An overview of text-independent speaker recognition: From features to supervectors

Speech Communication
Variational conditional random fields for online speaker detection and tracking

Speech Communication

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper extends our previous work on feature transformation-based support vector machines for speaker recognition by proposing a joint MAP adaptation of feature transformation (FT) and Gaussian Mixture Models (GMM) parameters. In the new approach, the prior probability density functions (PDFs) of FT and GMM parameters are jointly estimated using the background data under the maximum likelihood criteria. In this way, we derive a generic prior GMM that is more compact than the Universal Background Model due to the reduction of speaker variations. With the prior PDFs, we construct a supervector to characterize a speaker using FT and GMM parameters. We conducted experiments on NIST 2006 Speaker Recognition Evaluation (SRE06) data set. The results validated the effectiveness of the joint MAP adaptation approach.