Speaker Verification Based on Different Vector Quantization Techniques with Gaussian Mixture Models

Authors:
Sheeraz Memon;Margaret Lech;Namunu Maddage
Affiliations:
-;-;-
Venue:
NSS '09 Proceedings of the 2009 Third International Conference on Network and System Security
Year:
2009

Citing 0
Cited 2

Audio-Visual feature fusion for speaker identification

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
Speaker recognition utilizing distributed DCT-II based Mel frequency cepstral coefficients and fuzzy vector quantization

International Journal of Speech Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

The introduction of Gaussian Mixture Models (GMMs) in the field of speaker verification has led to very good results. This paper illustrates an evolution in state-of-the-art Speaker Verification by highlighting the contribution of recently established information theoretic based vector quantization technique. We explore the novel application of three different vector quantization algorithms, namely K-means, Linde-Buzo-Gray (LBG) and Information Theoretic Vector Quantization (ITVQ) for efficient speaker verification. The Expectation Maximization (EM) algorithm used by GMM requires a prohibitive amount of iterations to converge. In this paper, comparable alternatives to EM including K-means, LBG and ITVQ algorithm were tested. The GMM-ITVQ algorithm was found to be the most efficient alternative for the GMM-EM. It gives correct classification rates at a similar level to that of GMM-EM. Finally, representative performance benchmarks and system behaviour experiments on NIST SRE corpora are presented.