Patch-based pose inference with a mixture of density estimators

Authors:
David Demirdjian;Raquel Urtasun
Affiliations:
Computer Science and Artificial Intelligence Laboratory, Cambridge, MA;Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
Venue:
AMFG'07 Proceedings of the 3rd international conference on Analysis and modeling of faces and gestures
Year:
2007

Citing 12
Cited 1

Mean Shift, Mode Seeking, and Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
Active Appearance Models

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume II - Volume II
Learning to Parse Pictures of People

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Fast Pose Estimation with Parameter-Sensitive Hashing

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Learning to track 3D human motion from silhouettes

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Pictorial Structures for Object Recognition

International Journal of Computer Vision
Face Recognition Using Laplacianfaces

IEEE Transactions on Pattern Analysis and Machine Intelligence
Priors for People Tracking from Small Training Sets

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Kinematic jump processes for monocular 3D human tracking

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Nonparametric belief propagation

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
A local basis representation for estimating human pose from cluttered images

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I

Body-Part Templates for Recovery of 2D Human Poses under Occlusion

AMDO '08 Proceedings of the 5th international conference on Articulated Motion and Deformable Objects

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a patch-based approach for pose estimation from single images using a kernelized density voting scheme. We introduce a boosting-like algorithm that models the density using a mixture of weighted 'weak' estimators. The 'weak' density estimators and corresponding weights are learned iteratively from a training set, providing an efficient method for feature selection. Given a query image, voting is performed by reference patches similar in appearance to query image patches. Locality in the voting scheme allows us to handle occlusions and reduces the size of the training set required to cover the space of possible poses and appearance. Finally, the pose is estimated as the dominant mode in the density. Multimodality can be handled by looking at multiple dominant modes. Experiments carried out on face and articulated body pose databases show that our patch-based pose estimation algorithm generalizes well to unseen examples, is robust to occlusions and provides accurate pose estimation.