Speaker identification using row mean vector of spectrogram

Authors:
H. B. Kekre;A. Athawale;M. Desai
Affiliations:
MPSTME, SVKM's NMIMS University, Mumbai, India;Thadomal Shahani Engg. College, Bandra (W), Mumbai, India;K. J. Somaiya Institute of Engg. and Information Technology, Sion (W), Mumbai, India
Venue:
Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Year:
2011

Citing 2
Cited 0

A Speaker Identification System Using MFCC Features with VQ Technique

IITA '09 Proceedings of the 2009 Third International Symposium on Intelligent Information Technology Application - Volume 03
Image retrieval using DCT on row mean, column mean and both with image fragmentation

Proceedings of the International Conference and Workshop on Emerging Trends in Technology

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this paper a simple approach to text dependent speaker identification using spectrograms and row mean is presented. This, mainly, revolves around trapping the complex patterns of variation in frequency and amplitude with time while an individual utters a given word through equalized spectrogram. These equalized spectrograms are used as a database to successfully identify the unknown individual from his/her voice. The features used for identifying, rely on optimal spectrogram segmentation and the Euclidean distance of the distributional features of the spectrograms of the unknown voice with that of a given known speaker in the database. Performance of this novel approach on a sample collected as two separate databases from 12 speakers and 28 speakers show that this methodology can be effectively used to produce a desirable success rate.