Speaker diarization using unsupervised discriminant analysis of inter-channel delay features

  • Authors:
  • Nicholas W. D. Evans;Corinne Fredouille;Jean-Francois Bonastre

  • Affiliations:
  • EURECOM, Sophia Antipolis, France;Laboratoire Informatique d-Avignon (LIA), University of Avignon, France;Laboratoire Informatique d-Avignon (LIA), University of Avignon, France

  • Venue:
  • ICASSP '09 Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

When multiple microphones are available estimates of inter-channel delay, which characterise a speaker-s location, can be used as features for speaker diarization. Background noise and reverberation can, however, lead to noisy features and poor performance. To ameliorate these problems, this paper presents a new approach to the discriminant analysis of delay features for speaker diarization. This novel and nonetheless unsupervised approach aims to increase speaker separability in delay-space. We assess the approach on subsets of four standard NIST RT datasets and demonstrate a relative improvement in diarization error rate of 25% on a separate evaluation set using delay features alone.