An overview of text-independent speaker recognition: From features to supervectors

  • Authors:
  • Tomi Kinnunen;Haizhou Li

  • Affiliations:
  • Department of Computer Science and Statistics, Speech and Image Processing Unit, University of Joensuu, P.O. Box 111, 80101 Joensuu, Finland;Department of Human Language Technology, Institute for Infocomm Research (I2R), 1 Fusionopolis Way, #21-01 Connexis, South Tower, Singapore 138632, Singapore

  • Venue:
  • Speech Communication
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper gives an overview of automatic speaker recognition technology, with an emphasis on text-independent recognition. Speaker recognition has been studied actively for several decades. We give an overview of both the classical and the state-of-the-art methods. We start with the fundamentals of automatic speaker recognition, concerning feature extraction and speaker modeling. We elaborate advanced computational techniques to address robustness and session variability. The recent progress from vectors towards supervectors opens up a new area of exploration and represents a technology trend. We also provide an overview of this recent development and discuss the evaluation methodology of speaker recognition systems. We conclude the paper with discussion on future directions.