Time difference of arrival estimation of speech source in a noisy and reverberant environment

  • Authors:
  • Tsvi G. Dvorkind;Sharon Gannot

  • Affiliations:
  • Faculty of Electrical Engineering, Technion, Technion City, 32000 Haifa, Israel;School of Electrical Engineering, Bar-Ilan University, 52900 Ramat-Gan, Israel

  • Venue:
  • Signal Processing - Content-based image and video retrieval
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Determining the spatial position of a speaker finds a growing interest in video conference scenarios where automated camera steering and tracking are required. Speaker localization can be achieved with a dual-step approach. In the preliminary stage a microphone array is used to extract the time difference of arrival (TDOA) of the speech signal. These readings are then used by the second stage for the actual localization. In this work we present novel, frequency domain, approaches for TDOA calculation in a reverberant and noisy environment. Our methods are based on the speech quasistationarity property, noise stationarity and on the fact that the speech and the noise are uncorrelated. The mathematical derivations in this work are followed by an extensive experimental study which involves static and tracking scenarios.