Speaker identification using row mean vector of spectrogram

  • Authors:
  • H. B. Kekre;A. Athawale;M. Desai

  • Affiliations:
  • MPSTME, SVKM's NMIMS University, Mumbai, India;Thadomal Shahani Engg. College, Bandra (W), Mumbai, India;K. J. Somaiya Institute of Engg. and Information Technology, Sion (W), Mumbai, India

  • Venue:
  • Proceedings of the International Conference & Workshop on Emerging Trends in Technology
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper a simple approach to text dependent speaker identification using spectrograms and row mean is presented. This, mainly, revolves around trapping the complex patterns of variation in frequency and amplitude with time while an individual utters a given word through equalized spectrogram. These equalized spectrograms are used as a database to successfully identify the unknown individual from his/her voice. The features used for identifying, rely on optimal spectrogram segmentation and the Euclidean distance of the distributional features of the spectrograms of the unknown voice with that of a given known speaker in the database. Performance of this novel approach on a sample collected as two separate databases from 12 speakers and 28 speakers show that this methodology can be effectively used to produce a desirable success rate.