Improving GMM-UBM speaker verification using discriminative feedback adaptation

Authors:
Yi-Hsiang Chao;Wei-Ho Tsai;Hsin-Min Wang
Affiliations:
Department of Applied Geomatics, Ching Yun University, Taoyuan, Taiwan;Department of Electronic Engineering, National Taipei University of Technology, Taipei, Taiwan;Institute of Information Science, Academia Sinica, Taipei, Taiwan
Venue:
Computer Speech and Language
Year:
2009

Citing 2
Cited 0

Speaker identification and verification using Gaussian mixture speaker models

Speech Communication
Spoken Language Processing: A Guide to Theory, Algorithm, and System Development

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Gaussian mixture model - Universal background model (GMM-UBM) system is one of the predominant approaches for text-independent speaker verification, because both the target speaker model and the impostor model (UBM) have generalization ability to handle ''unseen'' acoustic patterns. However, since GMM-UBM uses a common anti-model, namely UBM, for all target speakers, it tends to be weak in rejecting impostors' voices that are similar to the target speaker's voice. To overcome this limitation, we propose a discriminative feedback adaptation (DFA) framework that reinforces the discriminability between the target speaker model and the anti-model, while preserving the generalization ability of the GMM-UBM approach. This is achieved by adapting the UBM to a target speaker dependent anti-model based on a minimum verification squared-error criterion, rather than estimating the model from scratch by applying the conventional discriminative training schemes. The results of experiments conducted on the NIST2001-SRE database show that DFA substantially improves the performance of the conventional GMM-UBM approach.