Speech Communication - Special issue on speech processing in adverse conditions
Adaptive filter theory (3rd ed.)
Adaptive filter theory (3rd ed.)
A Robust Method for Speech Signal Time-Delay Estimation in Reverberant Rooms
ICASSP '97 Proceedings of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '97) -Volume 1 - Volume 1
Multi-source localization in reverberant environments by ROOT-MUSIC and clustering
ICASSP '00 Proceedings of the Acoustics, Speech, and Signal Processing, 2000. on IEEE International Conference - Volume 02
Noise cancellation with static mixtures of a nonstationary signal and stationary noise
EURASIP Journal on Applied Signal Processing
EURASIP Journal on Applied Signal Processing
IEEE Transactions on Signal Processing
A frequency domain blind signal separation method based ondecorrelation
IEEE Transactions on Signal Processing
On the application of the global matched filter to DOA estimation with uniform circular arrays
IEEE Transactions on Signal Processing
System identification using nonstationary signals
IEEE Transactions on Signal Processing
Signal enhancement using beamforming and nonstationarity withapplications to speech
IEEE Transactions on Signal Processing
Time delay estimation in room acoustic environments: an overview
EURASIP Journal on Applied Signal Processing
Microphone array speaker localizers using spatial-temporal information
EURASIP Journal on Applied Signal Processing
TDOA-based adaptive sensing in multi-agent cooperative target tracking
Signal Processing
Speaker Tracking Using Recursive EM Algorithms
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Hi-index | 0.00 |
Determining the spatial position of a speaker finds a growing interest in video conference scenarios where automated camera steering and tracking are required. Speaker localization can be achieved with a dual-step approach. In the preliminary stage a microphone array is used to extract the time difference of arrival (TDOA) of the speech signal. These readings are then used by the second stage for the actual localization. In this work we present novel, frequency domain, approaches for TDOA calculation in a reverberant and noisy environment. Our methods are based on the speech quasistationarity property, noise stationarity and on the fact that the speech and the noise are uncorrelated. The mathematical derivations in this work are followed by an extensive experimental study which involves static and tracking scenarios.