Multimodal people detection and tracking in crowded scenes

Authors:
Luciano Spinello;Rudolph Triebel;Roland Siegwart
Affiliations:
Autonomous Systems Lab, ETH Zurich, Zurich, Switzerland;Autonomous Systems Lab, ETH Zurich, Zurich, Switzerland;Autonomous Systems Lab, ETH Zurich, Zurich, Switzerland
Venue:
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Year:
2008

Citing 12
Cited 5

A training algorithm for optimal margin classifiers

COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
A review of statistical data association for motion correspondence

International Journal of Computer Vision
A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences - Special issue: 26th annual ACM symposium on the theory of computing & STOC'94, May 23–25, 1994, and second annual Europe an conference on computational learning theory (EuroCOLT'95), March 13–15, 1995
The visual analysis of human movement: a survey

Computer Vision and Image Understanding
Probabilistic Data Association Methods for Tracking Complex Visual Objects

IEEE Transactions on Pattern Analysis and Machine Intelligence
Probabilistic Methods for Finding People

International Journal of Computer Vision
Detecting Pedestrians Using Patterns of Motion and Appearance

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Robust Real-Time Face Detection

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Pedestrian Detection in Crowded Scenes

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions

FOCS '06 Proceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science

Moving obstacle detection in highly dynamic scenes

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Detecting pedestrians at very small scales

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Vision and RFID data fusion for tracking people in crowds by a mobile robot

Computer Vision and Image Understanding
Multiclass Multimodal Detection and Tracking in Urban Environments

International Journal of Robotics Research
Exploiting repetitive object patterns for model compression and completion

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a novel people detection and tracking method based on a multi-modal sensor fusion approach that utilizes 2D laser range and camera data. The data points in the laser scans are clustered using a novel graph-based method and an SVM based version of the cascaded AdaBoost classifier is trained with a set of geometrical features of these clusters. In the detection phase, the classified laser data is projected into the camera image to define a region of interest for the vision-based people detector. This detector is a fast version of the Implicit Shape Model (ISM) that learns an appearance codebook of local SIFT descriptors from a set of hand-labeled images of pedestrians and uses them in a voting scheme to vote for centers of detected people. The extension consists in a fast and detailed analysis of the spatial distribution of voters per detected person. Each detected person is tracked using a greedy data association method and multiple Extended Kalman Filters that use different motion models. This way, the filter can cope with a variety of different motion patterns. The tracker is asynchronously updated by the detections from the laser and the camera data. Experiments conducted in real-world outdoor scenarios with crowds of pedestrians demonstrate the usefulness of our approach.