Real-time speaker identification and verification

Authors:
T. Kinnunen;E. Karpov;P. Franti
Affiliations:
Dept. of Comput. Sci., Univ. of Joensuu, Finland;-;-
Venue:
IEEE Transactions on Audio, Speech, and Language Processing
Year:
2006

Citing 0
Cited 16

Accuracy of MFCC-based speaker recognition in series 60 device

EURASIP Journal on Applied Signal Processing
Real-time speaker identification system

ACS'07 Proceedings of the 7th Conference on 7th WSEAS International Conference on Applied Computer Science - Volume 7
Speaker Segmentation for Air Traffic Control

Speaker Classification II
Efficient likelihood evaluation and dynamic Gaussian selection for HMM-based speech recognition

Computer Speech and Language
An overview of text-independent speaker recognition: From features to supervectors

Speech Communication
Particle swarm optimization for sorted adapted Gaussian mixture models

IEEE Transactions on Audio, Speech, and Language Processing
Speaker recognition using speaker-independent universal acoustic model and synchronous sensing for "business microscope"

ISWPC'09 Proceedings of the 4th international conference on Wireless pervasive computing
"Bag of codes" based automatic speaker identification

ASID'09 Proceedings of the 3rd international conference on Anti-Counterfeiting, security, and identification in communication
Spectral entropy and spectral shape based pre-quantization for real time speaker identification system

International Journal of Speech Technology
Comparison of the impact of some Minkowski metrics on VQ/GMM based speaker recognition

Computers and Electrical Engineering
Comparison of clustering methods: A case study of text-independent speaker modeling

Pattern Recognition Letters
A multi-resolution multi-classifier system for speaker verification

Expert Systems: The Journal of Knowledge Engineering
A speaker recognition based approach for identifying voice spammer

WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Real-Time Speaker Verification System Implemented on Reconfigurable Hardware

Journal of Signal Processing Systems
The sound of silence

Proceedings of the 11th ACM Conference on Embedded Networked Sensor Systems
Local business ambience characterization through mobile audio sensing

Proceedings of the 23rd international conference on World wide web

Quantified Score

Hi-index	0.00

Visualization

Abstract

In speaker identification, most of the computation originates from the distance or likelihood computations between the feature vectors of the unknown speaker and the models in the database. The identification time depends on the number of feature vectors, their dimensionality, the complexity of the speaker models and the number of speakers. In this paper, we concentrate on optimizing vector quantization (VQ) based speaker identification. We reduce the number of test vectors by pre-quantizing the test sequence prior to matching, and the number of speakers by pruning out unlikely speakers during the identification process. The best variants are then generalized to Gaussian mixture model (GMM) based modeling. We apply the algorithms also to efficient cohort set search for score normalization in speaker verification. We obtain a speed-up factor of 16:1 in the case of VQ-based modeling with minor degradation in the identification accuracy, and 34:1 in the case of GMM-based modeling. An equal error rate of 7% can be reached in 0.84 s on average when the length of test utterance is 30.4 s.