Inclusion of Video Information for Detection of Acoustic Events Using the Fuzzy Integral

  • Authors:
  • Taras Butko;Andrey Temko;Climent Nadeu;Cristian Canton

  • Affiliations:
  • Department of Signal Theory and Communications, , and TALP Research Center,;Department of Signal Theory and Communications, , and TALP Research Center,;Department of Signal Theory and Communications, , and TALP Research Center,;Department of Signal Theory and Communications, ,

  • Venue:
  • MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

When applied to interactive seminars, the detection of acoustic events from only audio information shows a large amount of errors, which are mostly due to the temporal overlaps of sounds. Video signals may be a useful additional source of information to cope with that problem for particular events. In this work, we aim at improving the detection of steps by using two audio-based Acoustic Event Detection (AED) systems, with SVM and HMM, and a video-based AED system, which employs the output of a 3D video tracking algorithm. The fuzzy integral is used to fuse the outputs of the three detection systems. Experimental results using the CLEAR 2007 evaluation data show that video information can be successfully used to improve the results of audio-based AED.