"Bag of codes" based automatic speaker identification

  • Authors:
  • Ming-Liang Gu;Zhe Chen;Jin-Song Wang;Jing-Lan Feng

  • Affiliations:
  • School of Physics & Electronic Engineering, Xuzhou Normal University, Xuzhou, Jiangsu, China and School of Linguistic Sciences, Xuzhou Normal University, Xuzhou, Jiangsu, China;School of Linguistic Sciences, Xuzhou Normal University, Xuzhou, Jiangsu, China;School of Information & Communication, Xuzhou Normal University, Xuzhou, Jiangsu, China;School of Linguistic Sciences, Xuzhou Normal University, Xuzhou, Jiangsu, China

  • Venue:
  • ASID'09 Proceedings of the 3rd international conference on Anti-Counterfeiting, security, and identification in communication
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The response time required in speaker identification systems mainly depend on the amount of enrolled speakers. Thus, how to reduce the computational cost, when evaluating large speaker databases, is the key problem. Thus, a "bag of codes" algorithm is proposed, which can generate speaker models by estimating the probability distribution of each code in speech data. Experiments prove that the new configuration has substantially lower complexity than commonly used methods with comparable identification accuracy, overcoming one bottleneck in the development of speaker identification research.