Tracking intermittently speaking multiple speakers using a particle filter

  • Authors:
  • Angela Quinlan;Mitsuru Kawamoto;Yosuke Matsusaka;Hideki Asoh;Futoshi Asano

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • EURASIP Journal on Audio, Speech, and Music Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The problem of tracking multiple intermittently speaking speakers is difficult as some distinct problems must be addressed. The number of active speakers must be estimated, these active speakers must be identified, and the locations of all speakers including inactive speakers must be tracked. In this paper we propose a method for tracking intermittently speaking multiple speakers using a particle filter. In the proposed algorithm the number of active speakers is firstly estimated based on the Exponential Fitting Test (EFT), a source number estimation technique which we have proposed. The locations of the speakers are then tracked using a particle filtering framework within which the decomposed likelihood is used in order to decouple the observed audio signal and associate each element of the decomposed signal with an active speaker. The tracking accuracy is then further improved by the inclusion of a silence region detection step and estimation of the noise-only covariance matrix. The method was evaluated using live recordings of 3 speakers and the results show that the method produces highly accurate tracking results.