Speaker Recognition Using Pole Distribution of Speech Signals Obtained by Bagging CAN2

  • Authors:
  • Shuichi Kurogi;Seitaro Sato;Kota Ichimaru

  • Affiliations:
  • Kyushu Institute of technology, Fukuoka, Japan 804-8550;Kyushu Institute of technology, Fukuoka, Japan 804-8550;Kyushu Institute of technology, Fukuoka, Japan 804-8550

  • Venue:
  • ICONIP '09 Proceedings of the 16th International Conference on Neural Information Processing: Part I
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

A method for speaker recognition which uses feature vectors of pole distribution derived from the piecewise linear predictive coefficients obtained by the bagging CAN2 (competitive associative net 2) is presented. The CAN2 is a neural net for learning efficient piecewise linear approximation of nonlinear function, and the bagging CAN2 has been shown to have a stable performance in reproduction and recognition of vowel signals. After training the bagging CAN2 with the speech signal of a speaker, the present method obtains a number of poles of piecewise linear predictive coefficients which are expected to reflect the shape and the scale of the speaker's vocal tract. Then, the pole distribution is used as the feature vector for the speaker recognition. The effectiveness is examined and validated with real speech data.