A Graphical Model for Audiovisual Object Tracking
IEEE Transactions on Pattern Analysis and Machine Intelligence
Towards reliable multimodal sensing in aware environments
Proceedings of the 2001 workshop on Perceptive user interfaces
Audio-video array source separation for perceptual user interfaces
Proceedings of the 2001 workshop on Perceptive user interfaces
New direct approaches to robust sound source localization
ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 2
Robotics and Autonomous Systems
Joint audio-visual tracking using particle filters
EURASIP Journal on Applied Signal Processing
IEEE Transactions on Signal Processing
Hi-index | 0.00 |
Accurate and fast localization of multiple speech sound sources is a significant problem in videoconferencing systems. Based on the observation that the wavelengths of the sound from a speech source are comparable to the dimensions of the space being searched, and that the source is broadband, we develop an efficient search strategy that finds the source(s) in a given space. The search is made efficient by using coarse-to-fine strategies in both space and frequency. The algorithm is shown to be robust compared to typical delay-based estimators and fast enough for real-time implementation. Its performance can be further improved by using constraints from computer vision.