Speaker localization in CHIL lectures: evaluation criteria and results

  • Authors:
  • Maurizio Omologo;Piergiorgio Svaizer;Alessio Brutti;Luca Cristoforetti

  • Affiliations:
  • ITC-irst, Povo, Trento, Italy;ITC-irst, Povo, Trento, Italy;ITC-irst, Povo, Trento, Italy;ITC-irst, Povo, Trento, Italy

  • Venue:
  • MLMI'05 Proceedings of the Second international conference on Machine Learning for Multimodal Interaction
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work addresses the problem of automatic speaker localization and tracking in a real lecture scenario. Evaluation criteria recently adopted under CHIL and NIST benchmarking are outlined. Two speaker localization systems are described, which are based on the use of Generalized Cross Correlation Phase Transform analysis and Global Coherence Field. Benchmarking results, obtained on a set of 13 lectures, showed an average RMS error of about 30 cm in the speaker localization.