Time difference of arrival estimation of speech source in a noisy and reverberant environment
Signal Processing - Content-based image and video retrieval
Multiple source localization based on acoustic map de-emphasis
EURASIP Journal on Audio, Speech, and Music Processing
A high accuracy, low-latency, scalable microphone-array system for conversation analysis
Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Hi-index | 0.00 |
Localization of acoustic sources in reverberant environments by microphone arrays remains a challenging task in audio signal processing. As a matter of fact, most assumptions of commonly adopted models are not met in real applications. Moreover, in practical systems it is not convenient or possible to employ sophisticated and costly architectures, that require precise synchronization and fast data shuffling among sensors. In this paper, a new robust multi-step procedure for speaker localization in reverberant rooms is introduced and described. The new approach is based on a disturbed harmonics model of time delays in the frequency domain and employs the well-known ROOT-MUSIC algorithm, after a preliminary distributed processing of the received signals. Candidate source positions are then estimated by clustering of raw TDOA estimates. Main features of the proposed approach, compared to previous solutions, are the capability of tracking multiple speakers and the high accuracy of the closed form TDOA estimator.