A regularized kernel-based approach to unsupervised audio segmentation

  • Authors:
  • Zaid Harchaoui;Felicien Vallet;Alexandre Lung-Yut-Fong;Olivier Cappe

  • Affiliations:
  • LTCI, TELECOM ParisTech&CNRS, 46 rue Barrault, 75634 cedex 13, France;LTCI, TELECOM ParisTech&CNRS, 46 rue Barrault, 75634 cedex 13, France;LTCI, TELECOM ParisTech&CNRS, 46 rue Barrault, 75634 cedex 13, France;LTCI, TELECOM ParisTech&CNRS, 46 rue Barrault, 75634 cedex 13, France

  • Venue:
  • ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce a regularized kernel-based rule for unsupervised change detection based on a simpler version of the recently proposed kernel Fisher discriminant ratio. Compared to other kernel-based change detectors found in the literature, the proposed test statistic is easier to compute and has a known asymptotic distribution which can effectively be used to set the false alarm rate a priori. This technique is applied for segmenting tracks from TV shows, both for segmentation into semantically homogeneous sections (applause, movie, music, etc.) and for speaker diarization within the speech sections. On these tasks, the proposed approach outperforms other kernel-based tests and is competitive with a standard HMM-based supervised alternative.