Multiple-object tracking in cluttered and crowded public spaces

Authors:
Rhys Martin;Ognjen Arandjelović
Affiliations:
University of Cambridge, Department of Engineering, Cambridge, UK;University of Cambridge, Department of Engineering, Cambridge, UK
Venue:
ISVC'10 Proceedings of the 6th international conference on Advances in visual computing - Volume Part III
Year:
2010

Citing 9
Cited 0

Pfinder: Real-Time Tracking of the Human Body

IEEE Transactions on Pattern Analysis and Machine Intelligence
W4: Real-Time Surveillance of People and Their Activities

IEEE Transactions on Pattern Analysis and Machine Intelligence
Pedestrian Detection from a Moving Vehicle

ECCV '00 Proceedings of the 6th European Conference on Computer Vision-Part II
Moving Target Classification and Tracking from Real-time Video

WACV '98 Proceedings of the 4th IEEE Workshop on Applications of Computer Vision (WACV'98)
Detecting Pedestrians Using Patterns of Motion and Appearance

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
The Template Update Problem

IEEE Transactions on Pattern Analysis and Machine Intelligence
Counting Crowded Moving Objects

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Unsupervised Bayesian Detection of Independent Motion in Crowds

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Tracking multiple humans in crowded environment

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper addresses the problem of tracking moving objects of variable appearance in challenging scenes rich with features and texture. Reliable tracking is of pivotal importance in surveillance applications. It is made particularly difficult by the nature of objects encountered in such scenes: these too change in appearance and scale, and are often articulated (e.g. humans). We propose a method which uses fast motion detection and segmentation as a constraint for both building appearance models and their robust propagation (matching) in time. The appearance model is based on sets of local appearances automatically clustered using spatio-kinetic similarity, and is updated with each new appearance seen. This integration of all seen appearances of a tracked object makes it extremely resilient to errors caused by occlusion and the lack of permanence of due to low data quality, appearance change or background clutter. These theoretical strengths of our algorithm are empirically demonstrated on two hour long video footage of a busy city marketplace.