Generative model-based speaker clustering via mixture of von Mises-Fisher distributions

Authors:
Hao Tang;Stephen M. Chu;Thomas S. Huang
Affiliations:
Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, 61801, USA;IBM T. J. Watson Research Center, Yorktown Heights, N.Y. 10598, USA;Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, 61801, USA
Venue:
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Year:
2009

Citing 0
Cited 3

Locality preserving speaker clustering

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Singing speaker clustering based on subspace learning in the GMM mean supervector space

Speech Communication
A unified framework for domain independent online speaker indexing in eigen-voice space using an index tree of reference models

International Journal of Speech Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a generative model-based speaker clustering algorithm in the maximum a posteriori adapted Gaussian mixture model (GMM) mean supervector space. The algorithm can be viewed as an extension of the standard expectation maximization algorithm for fitting a mixture model to the data, which iterates between two steps - a sample re-assignment step (E-step) and a model re-estimation step (M-step) - until it converges. The directional scattering patterns of GMM mean supervectors suggest that we employ a mixture of von Mises-Fisher distributions in the model re-estimation step. In the sample re-assignment step, four sample-to-mixture assignment strategies, namely soft, hard, stochastic, and deterministic annealing assignments, are used. Our experiments on the GALE Mandarin dataset show that the use of a mixture of von Mises-Fisher distributions as the underlying model yields significantly higher speaker clustering accuracies than the use of a mixture of Gaussian distributions. It is further shown that deterministic annealing assignment outperforms soft assignment, that soft assignment is comparable to stochastic assignment, and that both soft and stochastic assignments outperform hard assignment.