Estimating human pose from occluded images

Authors:
Jia-Bin Huang;Ming-Hsuan Yang
Affiliations:
Electrical Engineering and Computer Science, University of California at Merced;Electrical Engineering and Computer Science, University of California at Merced
Venue:
ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part I
Year:
2009

Citing 26
Cited 0

Atomic Decomposition by Basis Pursuit

SIAM Journal on Scientific Computing
The visual analysis of human movement: a survey

Computer Vision and Image Understanding
A survey of computer vision-based human motion capture

Computer Vision and Image Understanding - Modeling people toward vision-based underatanding of a person's shape, appearance, and movement
Probabilistic Methods for Finding People

International Journal of Computer Vision
Learning to Parse Pictures of People

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Estimating Human Body Configurations Using Shape Context Matching

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part III
Shadow Puppetry

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Sparse bayesian learning and the relevance vector machine

The Journal of Machine Learning Research
Inferring 3D Structure with a Statistical Image-Based Shape Model

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Fast Pose Estimation with Parameter-Sensitive Hashing

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Convex Optimization

Convex Optimization
Discriminative Density Propagation for 3D Human Motion Estimation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Recovering 3D Human Pose from Monocular Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)

Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)
The Representation and Matching of Pictorial Structures

IEEE Transactions on Computers
BM3E: Discriminative Density Propagation for Visual Tracking

IEEE Transactions on Pattern Analysis and Machine Intelligence
Relevant Feature Selection for Human Pose Estimation and Localization in Cluttered Images

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part II
Robust Face Recognition via Sparse Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning Generative Models for Multi-Activity Body Pose Estimation

International Journal of Computer Vision
Recovering human body configurations: combining segmentation and recognition

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Inferring 3D body pose from silhouettes using activity manifold learning

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Predicting 3d people from 2d pictures

AMDO'06 Proceedings of the 4th international conference on Articulated Motion and Deformable Objects
A local basis representation for estimating human pose from cluttered images

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I
Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information

IEEE Transactions on Information Theory
Compressed sensing

IEEE Transactions on Information Theory
Near-Optimal Signal Recovery From Random Projections: Universal Encoding Strategies?

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.00

Visualization

Abstract

We address the problem of recovering 3D human pose from single 2D images, in which the pose estimation problem is formulated as a direct nonlinear regression from image observation to 3D joint positions. One key issue that has not been addressed in the literature is how to estimate 3D pose when humans in the scenes are partially or heavily occluded. When occlusions occur, features extracted from image observations (e.g., silhouettes-based shape features, histogram of oriented gradient, etc.) are seriously corrupted, and consequently the regressor (trained on un-occluded images) is unable to estimate pose states correctly. In this paper, we present a method that is capable of handling occlusions using sparse signal representations, in which each test sample is represented as a compact linear combination of training samples. The sparsest solution can then be efficiently obtained by solving a convex optimization problem with certain norms (such as l1-norm). The corrupted test image can be recovered with a sparse linear combination of un-occluded training images which can then be used for estimating human pose correctly (as if no occlusions exist). We also show that the proposed approach implicitly performs relevant feature selection with un-occluded test images. Experimental results on synthetic and real data sets bear out our theory that with sparse representation 3D human pose can be robustly estimated when humans are partially or heavily occluded in the scenes.