Mean Shift, Mode Seeking, and Clustering
IEEE Transactions on Pattern Analysis and Machine Intelligence
ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume II - Volume II
Learning to Parse Pictures of People
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Object Recognition from Local Scale-Invariant Features
ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Fast Pose Estimation with Parameter-Sensitive Hashing
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Learning to track 3D human motion from silhouettes
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Pictorial Structures for Object Recognition
International Journal of Computer Vision
Face Recognition Using Laplacianfaces
IEEE Transactions on Pattern Analysis and Machine Intelligence
Priors for People Tracking from Small Training Sets
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Kinematic jump processes for monocular 3D human tracking
CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Nonparametric belief propagation
CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
A local basis representation for estimating human pose from cluttered images
ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I
Body-Part Templates for Recovery of 2D Human Poses under Occlusion
AMDO '08 Proceedings of the 5th international conference on Articulated Motion and Deformable Objects
Hi-index | 0.00 |
This paper presents a patch-based approach for pose estimation from single images using a kernelized density voting scheme. We introduce a boosting-like algorithm that models the density using a mixture of weighted 'weak' estimators. The 'weak' density estimators and corresponding weights are learned iteratively from a training set, providing an efficient method for feature selection. Given a query image, voting is performed by reference patches similar in appearance to query image patches. Locality in the voting scheme allows us to handle occlusions and reduces the size of the training set required to cover the space of possible poses and appearance. Finally, the pose is estimated as the dominant mode in the density. Multimodality can be handled by looking at multiple dominant modes. Experiments carried out on face and articulated body pose databases show that our patch-based pose estimation algorithm generalizes well to unseen examples, is robust to occlusions and provides accurate pose estimation.