Active speech source localization by a dual coarse-to-fine search

  • Authors:
  • R. Duraiswami;D. Zotkin;L. S. Davis

  • Affiliations:
  • Inst. for Adv. Comput. Studies, Maryland Univ., College Park, MD, USA;-;-

  • Venue:
  • ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 05
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Accurate and fast localization of multiple speech sound sources is a significant problem in videoconferencing systems. Based on the observation that the wavelengths of the sound from a speech source are comparable to the dimensions of the space being searched, and that the source is broadband, we develop an efficient search strategy that finds the source(s) in a given space. The search is made efficient by using coarse-to-fine strategies in both space and frequency. The algorithm is shown to be robust compared to typical delay-based estimators and fast enough for real-time implementation. Its performance can be further improved by using constraints from computer vision.