Dominance detection in a reverberated acoustic scenario

  • Authors:
  • Emanuele Principi;Rudy Rotili;Martin Wöllmer;Stefano Squartini;Björn Schuller

  • Affiliations:
  • Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Ancona, Italy;Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Ancona, Italy;Institute for Human-Machine Communication, Technische Universität München, Germany;Dipartimento di Ingegneria dell'Informazione, Università Politecnica delle Marche, Ancona, Italy;Institute for Human-Machine Communication, Technische Universität München, Germany

  • Venue:
  • ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part I
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work proposes a dominance detection framework operating in reverberated environments. The framework is composed of a speech enhancement front-end, which automatically reduces the distortions introduced by room reverberation in the speech signals, and a dominance detector, which processes the enhanced signals and estimates the most and least dominant person in a segment. The front-end is composed by three cooperating blocks: speaker diarization, room impulse responses identification and speech dereverberation. The dominance estimation algorithm is based on bidirectional Long Short-Term Memory networks which allow for context-sensitive activity classification from audio feature functionals extracted via the real-time speech feature extraction toolkit openSMILE. Experiments have been performed suitably reverberating the DOME dataset: the absolute accuracy improvement averaged over the addressed reverberated conditions is 32.68% in the most dominant person estimation task and 36.56% in the least dominant person estimation one, both with full agreement among annotators.