Virtual worlds and active learning for human detection

Authors:
David Vázquez;Antonio M. López;Daniel Ponsa;Javier Marín
Affiliations:
Computer Vision Center and Computer Science Dpt. UAB, Bellaterra, Spain;Computer Vision Center and Computer Science Dpt. UAB, Bellaterra, Spain;Computer Vision Center and Computer Science Dpt. UAB, Bellaterra, Spain;Computer Vision Center, Bellaterra, Spain
Venue:
ICMI '11 Proceedings of the 13th international conference on multimodal interfaces
Year:
2011

Citing 9
Cited 0

Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Model-based validation approaches and matching techniques for automotive vision based pedestrian detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Peekaboom: a game for locating objects in images

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Monocular Pedestrian Detection: Survey and Experiments

IEEE Transactions on Pattern Analysis and Machine Intelligence
A theory of learning from different domains

Machine Learning
The Pascal Visual Object Classes (VOC) Challenge

International Journal of Computer Vision
Survey of Pedestrian Detection for Advanced Driver Assistance Systems

IEEE Transactions on Pattern Analysis and Machine Intelligence
Real-time dense stereo for intelligent vehicles

IEEE Transactions on Intelligent Transportation Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Image based human detection is of paramount interest due to its potential applications in fields such as advanced driving assistance, surveillance and media analysis. However, even detecting non-occluded standing humans remains a challenge of intensive research. The most promising human detectors rely on classifiers developed in the discriminative paradigm, i.e. trained with labelled samples. However, labelling is a manual intensive step, especially in cases like human detection where it is necessary to provide at least bounding boxes framing the humans for training. To overcome such problem, some authors have proposed the use of a virtual world where the labels of the different objects are obtained automatically. This means that the human models (classifiers) are learnt using the appearance of rendered images, i.e. using realistic computer graphics. Later, these models are used for human detection in images of the real world. The results of this technique are surprisingly good. However, these are not always as good as the classical approach of training and testing with data coming from the same camera, or similar ones. Accordingly, in this paper we address the challenge of using a virtual world for gathering (while playing a videogame) a large amount of automatically labelled samples (virtual humans and background) and then training a classifier that performs equal, in real-world images, than the one obtained by equally training from manually labelled real-world samples. For doing that, we cast the problem as one of domain adaptation. In doing so, we assume that a small amount of manually labelled samples from real-world images is required. To collect these labelled samples we propose a non-standard active learning technique. Therefore, ultimately our human model is learnt by the combination of virtual and real world labelled samples, which has not been done before.