On the use of dot scoring for speaker diarization

  • Authors:
  • Mireia Diez;Mikel Penagarikano;Amparo Varona;Luis Javier Rodriguez-Fuentes;German Bordel

  • Affiliations:
  • GTTS, Department of Electricity and Electronics, University of the Basque Country, Spain;GTTS, Department of Electricity and Electronics, University of the Basque Country, Spain;GTTS, Department of Electricity and Electronics, University of the Basque Country, Spain.;GTTS, Department of Electricity and Electronics, University of the Basque Country, Spain;GTTS, Department of Electricity and Electronics, University of the Basque Country, Spain

  • Venue:
  • IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, an alternative dot scoring based agglomerative hierarchical clustering approach for speaker diarization is presented. Dot-scoring is a simple and fast technique used in speaker verification that makes use of a linearized procedure to score test segments against target models. In our speaker diarization approach speech segments are represented by MAP-adapted GMM zero and first order statistics, dot scoring is applied to compute a similarity measure between segments (or clusters) and finally an agglomerative clustering algorithm is applied until no pair of clusters exceeds a similarity threshold. This diarization system was developed for the Albayzin 2010 Speaker Diarization Evaluation on broadcast news. Results show that the lowest error rate that the clustering algorithm could attain for the evaluation set was around 20% and that over-segmentation was the main source of degradation, due to the lack of robustness in the estimation of statistics for short segments.