A regularized kernel-based approach to unsupervised audio segmentation

Authors:
Zaid Harchaoui;Felicien Vallet;Alexandre Lung-Yut-Fong;Olivier Cappe
Affiliations:
LTCI, TELECOM ParisTech&CNRS, 46 rue Barrault, 75634 cedex 13, France;LTCI, TELECOM ParisTech&CNRS, 46 rue Barrault, 75634 cedex 13, France;LTCI, TELECOM ParisTech&CNRS, 46 rue Barrault, 75634 cedex 13, France;LTCI, TELECOM ParisTech&CNRS, 46 rue Barrault, 75634 cedex 13, France
Venue:
ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
Year:
2009

Citing 0
Cited 1

Segmentation of the mean of heteroscedastic data via cross-validation

Statistics and Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce a regularized kernel-based rule for unsupervised change detection based on a simpler version of the recently proposed kernel Fisher discriminant ratio. Compared to other kernel-based change detectors found in the literature, the proposed test statistic is easier to compute and has a known asymptotic distribution which can effectively be used to set the false alarm rate a priori. This technique is applied for segmenting tracks from TV shows, both for segmentation into semantically homogeneous sections (applause, movie, music, etc.) and for speaker diarization within the speech sections. On these tasks, the proposed approach outperforms other kernel-based tests and is competitive with a standard HMM-based supervised alternative.