Audiovisual diarization of people in video content

  • Authors:
  • Elie El Khoury;Christine Sénac;Philippe Joly

  • Affiliations:
  • Idiap Research Institute, Martigny, Switzerland and Laboratoire d'Informatique de l'Université du Maine, Le Mans, France;Institut de Recherche en Informatique de Toulouse, Toulouse, France;Institut de Recherche en Informatique de Toulouse, Toulouse, France

  • Venue:
  • Multimedia Tools and Applications
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

Audio-Visual People Diarization (AVPD) is an original framework that simultaneously improves audio, video, and audiovisual diarization results. Following a literature review of people diarization for both audio and video content and their limitations, which includes our own contributions, we describe a proposed method for associating both audio and video information by using co-occurrence matrices and present experiments which were conducted on a corpus containing TV news, TV debates, and movies. Results show the effectiveness of the overall diarization system and confirm the gains audio information can bring to video indexing and vice versa.