"Bag of codes" based automatic speaker identification

Authors:
Ming-Liang Gu;Zhe Chen;Jin-Song Wang;Jing-Lan Feng
Affiliations:
School of Physics & Electronic Engineering, Xuzhou Normal University, Xuzhou, Jiangsu, China and School of Linguistic Sciences, Xuzhou Normal University, Xuzhou, Jiangsu, China;School of Linguistic Sciences, Xuzhou Normal University, Xuzhou, Jiangsu, China;School of Information & Communication, Xuzhou Normal University, Xuzhou, Jiangsu, China;School of Linguistic Sciences, Xuzhou Normal University, Xuzhou, Jiangsu, China
Venue:
ASID'09 Proceedings of the 3rd international conference on Anti-Counterfeiting, security, and identification in communication
Year:
2009

Citing 4
Cited 0

Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
A Bayesian Hierarchical Model for Learning Natural Scene Categories

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Real-time speaker identification and verification

IEEE Transactions on Audio, Speech, and Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The response time required in speaker identification systems mainly depend on the amount of enrolled speakers. Thus, how to reduce the computational cost, when evaluating large speaker databases, is the key problem. Thus, a "bag of codes" algorithm is proposed, which can generate speaker models by estimating the probability distribution of each code in speech data. Experiments prove that the new configuration has substantially lower complexity than commonly used methods with comparable identification accuracy, overcoming one bottleneck in the development of speaker identification research.