Enhanced VQ-based algorithms for speech independent speaker identification

Authors:
Ningping Fan;Justinian Rosca
Affiliations:
Siemens Corporate Research Inc., Princeton, New Jersey;Siemens Corporate Research Inc., Princeton, New Jersey
Venue:
AVBPA'03 Proceedings of the 4th international conference on Audio- and video-based biometric person authentication
Year:
2003

Citing 2
Cited 1

Fundamentals of speech recognition

Fundamentals of speech recognition
Speaker Discriminative Weighting Method for VQ-Based Speaker Identification

AVBPA '01 Proceedings of the Third International Conference on Audio- and Video-Based Biometric Person Authentication

The usage of independent component analysis for robust speaker verification

AIA'06 Proceedings of the 24th IASTED international conference on Artificial intelligence and applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Weighted distance measure and discriminative training are two different directions to enhance VQ-based solutions for speaker identification. In the first direction, the partition normalized distance measure successfully used normalized feature components to account for varying importance of the LPC coefficients. In the second direction, the group vector quantization speeded up discriminative training by randomly selecting a group of vectors as a training unit in each learning step. This paper introduces an alternative, called heuristic weighted distance, to linearly lift up higher order MFCC feature vector components. Then two new algorithms are proposed to combine the heuristic weighted distance and the partition normalized distance measure with the group vector quantization to take full advantage of both directions. Testing on the TIMIT and NTIMIT corpora showed that the proposed methods are superior to current VQ-based solutions, and are in a comparable range to the Gaussian Mixture Model using the Wavelet or MFCC features.