Discriminative training of GMM for speaker identification

Authors:
C. M. del Alamo;F. J. Caminero Gil;C. dela Torre Munilla;L. Hernandez Gomez
Affiliations:
Telefonica Investigacion y Desarrollo, Madrid, Spain;-;-;-
Venue:
ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Year:
1996

Citing 0
Cited 4

A discriminative training approach for text-independent speaker recognition

Signal Processing
Visual tag dictionary: interpreting tags with visual words

WSMC '09 Proceedings of the 1st workshop on Web-scale multimedia corpus
Robust speaker identification system based on wavelet transform and gaussian mixture model

IJCNLP'04 Proceedings of the First international joint conference on Natural Language Processing
HCRF-UBM approach for text-independent speaker identification

Computers & Mathematics with Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

We describe a novel discriminative training procedure for a Gaussian mixture model (GMM) speaker identification system. The proposal is based on the segmental generalized probabilistic descent (GPD) algorithm formulated to estimate the GMM parameters. Two major innovations over similar formulations of segmental GPD training are proposed. (1) A misclassification measure based on an individual representation of competing speakers, that explicitly allows to take into account different learning strategies for correctly or incorrectly classified speakers. (2) An empirical loss function to control the training procedure convergence, with a likelihood-based selection of correctly or incorrectly classified competing speakers. A comparison between the proposed method and the traditional GPD algorithm is also presented.