Capture interspeaker information with a neural network for speaker identification

Authors:
Lan Wang;Ke Chen;Huisheng Chi
Affiliations:
Dept. of Eng., Cambridge Univ.;-;-
Venue:
IEEE Transactions on Neural Networks
Year:
2002

Citing 0
Cited 3

A model-selection-based self-splitting Gaussian mixture learning with application to speaker identification

EURASIP Journal on Applied Signal Processing
GMM and ANN hybrid model and its application in speaker identification

ICNC'09 Proceedings of the 5th international conference on Natural computation
Efficient MLP constructive training algorithm using a neuron recruiting approach for isolated word recognition system

International Journal of Speech Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Model-based approach is one of methods widely used for speaker identification, where a statistical model is used to characterize a specific speaker's voice but no interspeaker information is involved in its parameter estimation. It is observed that interspeaker information is very helpful in discriminating between different speakers. In this paper, we propose a novel method for the use of interspeaker information to improve performance of a model-based speaker identification system. A neural network is employed to capture the interspeaker information from the output space of those statistical models. In order to sufficiently utilize interspeaker information, a rival penalized encoding rule is proposed to design supervised learning pairs. For better generalization, moreover, a query-based learning algorithm is presented to actively select the input data of interest during training of the neural network. Comparative results on the KING speech corpus show that our method leads to a considerable improvement for a model-based speaker identification system