Dynamic Time Warping Based Approach to Text-Dependent Speaker Identification Using Spectrograms

Authors:
Tridibesh Dutta
Affiliations:
-
Venue:
CISP '08 Proceedings of the 2008 Congress on Image and Signal Processing, Vol. 2 - Volume 02
Year:
2008

Citing 0
Cited 2

Learning People Trajectories Using Semi-directional Statistics

AVSS '09 Proceedings of the 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance
A multi-resolution multi-classifier system for speaker verification

Expert Systems: The Journal of Knowledge Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

The goal of this paper is to study a new approach to text dependent speaker identification using the complex patterns of variation in frequency and amplitude with time while an individual utters a given word through spectrogram segmentation and template matching. The optimally segmented spectrograms are used as a database to successfully identify the unknown individual from his/her voice. The methodology used for identifying, rely on classification of spectrograms (of speech signals), based on dynamic time warping (DTW) matching of conditionally quantized frequency-time domain features of the database samples and the unknown speech sample. Experimental results on a sample collected from 40 speakers show that this methodology can be effectively used to produce a desirable success rate.