Locality preserving speaker clustering

  • Authors:
  • Stephen M. Chu;Hao Tang;Thomas S. Huang

  • Affiliations:
  • IBM T. J. Watson Research Center, Yorktown Heights, N.Y.;Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, I.L.;Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, I.L.

  • Venue:
  • ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose an efficient speaker clustering approach based on a locality preserving linear projective mapping in the Gaussian mixture model (GMM) mean supervector space. While the GMM mean supervector has turned out to be an effective representation of speakers, its dimensionality is usually very high. The locality preserving projection (LPP) maps the high-dimensional GMM mean supervector space into a lower-dimensional subspace in an unsupervised fashion where the local neighborhood structure of the data points is optimally preserved. Our speaker clustering experiments clearly show that in the reduced-dimensional LPP subspace, traditional clustering techniques such as k-means and hierarchical clustering perform significantly better than they would in the original high-dimensional GMM mean supervector space and in its principal component subspace.