Person Tracking with Audio-Visual Cues Using the Iterative Decoding Framework

Authors:
Shankar T. Shivappa;Mohan M. Trivedi;Bhaskar D. Rao
Affiliations:
-;-;-
Venue:
AVSS '08 Proceedings of the 2008 IEEE Fifth International Conference on Advanced Video and Signal Based Surveillance
Year:
2008

Citing 0
Cited 2

Detecting, tracking and interacting with people in a public space

Proceedings of the 2009 international conference on Multimodal interfaces
Acoustic sensor-based multiple object tracking with visual information association

EURASIP Journal on Advances in Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Tracking humans in an indoor environment is an essential part of surveillance systems. Vision based and microphone array based trackers have been extensively researched in the past. Audio-visual tracking frameworks have also been developed. In this paper we consider human tracking to be a specific instance of a more general problem of information fusion in multimodal systems. Dynamic Bayesian networks have been the modeling technique of choice to build such information fusion schemes. The complexity and non-Gaussianity of distributions of the dynamic Bayesian networks for such multimodal systems have led to the use of particle filters as an approximate inference technique. In this paper we present an alternative approach to the information fusion problem. The iterative decoding algorithm is based on the theory of turbo codes and factor graphs used in communication systems. We modify and adapt the iterative decoding algorithm to do probabilistic inference for the problem of tracking humans in an indoor space, using multiple cameras and microphone arrays.