Time difference of arrival estimation of speech source in a noisy and reverberant environment

Authors:
Tsvi G. Dvorkind;Sharon Gannot
Affiliations:
Faculty of Electrical Engineering, Technion, Technion City, 32000 Haifa, Israel;School of Electrical Engineering, Bar-Ilan University, 52900 Ramat-Gan, Israel
Venue:
Signal Processing - Content-based image and video retrieval
Year:
2005

Citing 11
Cited 5

Assessment for automatic speech recognition II: NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems

Speech Communication - Special issue on speech processing in adverse conditions
Adaptive filter theory (3rd ed.)

Adaptive filter theory (3rd ed.)
A Robust Method for Speech Signal Time-Delay Estimation in Reverberant Rooms

ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Multi-source localization in reverberant environments by ROOT-MUSIC and clustering

ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
Noise cancellation with static mixtures of a nonstationary signal and stationary noise

EURASIP Journal on Applied Signal Processing
Robust adaptive time delay estimation for speaker localization in noisy and reverberant acoustic environments

EURASIP Journal on Applied Signal Processing
Maximum-likelihood source localization and unknown sensor locationestimation for wideband signals in the near-field

IEEE Transactions on Signal Processing
A frequency domain blind signal separation method based ondecorrelation

IEEE Transactions on Signal Processing
On the application of the global matched filter to DOA estimation with uniform circular arrays

IEEE Transactions on Signal Processing
System identification using nonstationary signals

IEEE Transactions on Signal Processing
Signal enhancement using beamforming and nonstationarity withapplications to speech

IEEE Transactions on Signal Processing

Time delay estimation in room acoustic environments: an overview

EURASIP Journal on Applied Signal Processing
Microphone array speaker localizers using spatial-temporal information

EURASIP Journal on Applied Signal Processing
Particle filtering for TDOA based acoustic source tracking: Nonconcurrent Multiple Talkers

Signal Processing
TDOA-based adaptive sensing in multi-agent cooperative target tracking

Signal Processing
Speaker Tracking Using Recursive EM Algorithms

IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Determining the spatial position of a speaker finds a growing interest in video conference scenarios where automated camera steering and tracking are required. Speaker localization can be achieved with a dual-step approach. In the preliminary stage a microphone array is used to extract the time difference of arrival (TDOA) of the speech signal. These readings are then used by the second stage for the actual localization. In this work we present novel, frequency domain, approaches for TDOA calculation in a reverberant and noisy environment. Our methods are based on the speech quasistationarity property, noise stationarity and on the fact that the speech and the noise are uncorrelated. The mathematical derivations in this work are followed by an extensive experimental study which involves static and tracking scenarios.