Recovering 3D Human Pose from Monocular Images

Authors:
Ankur Agarwal;Bill Triggs
Affiliations:
-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2006

Citing 21
Cited 123

The nature of statistical learning theory

The nature of statistical learning theory
CONDENSATION—Conditional Density Propagation forVisual Tracking

International Journal of Computer Vision
Making large-scale support vector machine learning practical

Advances in kernel methods
Comparison of approximate methods for handling hyperparameters

Neural Computation
Neural Networks for Pattern Recognition

Neural Networks for Pattern Recognition
Shape Matching and Object Recognition Using Shape Contexts

IEEE Transactions on Pattern Analysis and Machine Intelligence
Hyperplane Approximation for Template Matching

IEEE Transactions on Pattern Analysis and Machine Intelligence
Implicit Probabilistic Models of Human Motion for Synthesis and Tracking

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Estimating Human Body Configurations Using Shape Context Matching

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part III
Tracking People with Twists and Exponential Maps

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Shadow Puppetry

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
A Metric for Distributions with Applications to Image Databases

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Sparse bayesian learning and the relevance vector machine

The Journal of Machine Learning Research
Inferring 3D Structure with a Statistical Image-Based Shape Model

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
A Sparse Probabilistic Learning Algorithm for Real-Time Tracking

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Filtering Using a Tree-Based Estimator

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Fast Pose Estimation with Parameter-Sensitive Hashing

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Learning to track 3D human motion from silhouettes

ICML '04 Proceedings of the twenty-first international conference on Machine learning
3D human pose from silhouettes by relevance vector regression

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Kinematic jump processes for monocular 3D human tracking

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition

Computational studies of human motion: part 1, tracking and motion synthesis

Foundations and Trends® in Computer Graphics and Vision
Human action recognition using star skeleton

Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks
Character animation from 2D pictures and 3D motion data

ACM Transactions on Graphics (TOG)
A survey of advances in vision-based human motion capture and analysis

Computer Vision and Image Understanding - Special issue on modeling people: Vision-based understanding of a person's shape, appearance, movement, and behaviour
Viewpoint invariant exemplar-based 3D human tracking

Computer Vision and Image Understanding - Special issue on modeling people: Vision-based understanding of a person's shape, appearance, movement, and behaviour
Simultaneous gesture segmentation and recognition based on forward spotting accumulative HMMs

Pattern Recognition
Vision-based human motion analysis: An overview

Computer Vision and Image Understanding
BM3E: Discriminative Density Propagation for Visual Tracking

IEEE Transactions on Pattern Analysis and Machine Intelligence
Depth silhouettes for gesture recognition

Pattern Recognition Letters
Camera calibration from human motion

Image and Vision Computing
Generative tracking of 3D human motion by hierarchical annealed genetic algorithm

Pattern Recognition
Pose estimation and tracking using multivariate regression

Pattern Recognition Letters
A spatio-temporal 2D-models framework for human pose recovery in monocular sequences

Pattern Recognition
Human Motion Tracking with a Kinematic Parameterization of Extremal Contours

International Journal of Computer Vision
Simultaneous Segmentation and Pose Estimation of Humans Using Dynamic Graph Cuts

International Journal of Computer Vision
3D shape-encoded particle filter for object tracking and its application to human body tracking

Journal on Image and Video Processing - Anthropocentric Video Analysis: Tools and Applications
Using structured light for efficient depth edge detection

Image and Vision Computing
Staying Well Grounded in Markerless Motion Capture

Proceedings of the 30th DAGM symposium on Pattern Recognition
Body-Part Templates for Recovery of 2D Human Poses under Occlusion

AMDO '08 Proceedings of the 5th international conference on Articulated Motion and Deformable Objects
A 3D Shape Descriptor for Human Pose Recovery

AMDO '08 Proceedings of the 5th international conference on Articulated Motion and Deformable Objects
Ambiguity Modeling in Latent Spaces

MLMI '08 Proceedings of the 5th international workshop on Machine Learning for Multimodal Interaction
Monocular 3D tracking of articulated human motion in silhouette and pose manifolds

Journal on Image and Video Processing - Anthropocentric Video Analysis: Tools and Applications
People tracking and segmentation using spatiotemporal shape constraints

VNBA '08 Proceedings of the 1st ACM workshop on Vision networks for behavior analysis
On Bin Configuration of Shape Context Descriptors in Human Silhouette Classification

ACIVS '08 Proceedings of the 10th International Conference on Advanced Concepts for Intelligent Vision Systems
Fast nonparametric belief propagation for real-time stereo articulated body tracking

Computer Vision and Image Understanding
Tracking articulated objects by learning intrinsic structure of motion

Pattern Recognition Letters
Enhancing a Sign Language Translation System with Vision-Based Features

Gesture-Based Human-Computer Interaction and Simulation
Recovery of upper body poses in static images based on joints detection

Pattern Recognition Letters
A Single Camera Motion Capture System for Human-Computer Interaction

IEICE - Transactions on Information and Systems
Multimodal Human Machine Interactions in Virtual and Augmented Reality

Multimodal Signals: Cognitive and Algorithmic Issues
Monitoring Activities of Daily Living (ADLs) of Elderly Based on 3D Key Human Postures

Cognitive Vision
Video synchronization from human motion using rank constraints

Computer Vision and Image Understanding
Region-Based vs. Edge-Based Registration for 3D Motion Capture by Real Time Monoscopic Vision

MIRAGE '09 Proceedings of the 4th International Conference on Computer Vision/Computer Graphics CollaborationTechniques
Action-specific motion prior for efficient Bayesian 3D human body tracking

Pattern Recognition
Covariate Analysis for View-Point Independent Gait Recognition

ICB '09 Proceedings of the Third International Conference on Advances in Biometrics
Action recognition feedback-based framework for human pose reconstruction from monocular images

Pattern Recognition Letters
3D Human Pose Estimation from Static Images Using Local Features and Discriminative Learning

ICIAR '09 Proceedings of the 6th International Conference on Image Analysis and Recognition
Leveraging the talent of hand animators to create three-dimensional animation

Proceedings of the 2009 ACM SIGGRAPH/Eurographics Symposium on Computer Animation
Self-Organizing Maps for Pose Estimation with a Time-of-Flight Camera

Dyn3D '09 Proceedings of the DAGM 2009 Workshop on Dynamic 3D Imaging
Single-Frame 3D Human Pose Recovery from Multiple Views

Proceedings of the 31st DAGM Symposium on Pattern Recognition
Vision-based human pose estimation for pervasive computing

AMC '09 Proceedings of the 2009 workshop on Ambient media computing
Machine Vision Application to Automatic Intruder Detection Using CCTV

KES '09 Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part II
3D Human Body Tracking in Unconstrained Scenes

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Human Pose Tracking Using Motion-Based Search

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
A vision-based architecture for long-term human-robot interaction

IASTED-HCI '07 Proceedings of the Second IASTED International Conference on Human Computer Interaction
Human pose estimation from monocular image captures

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Twin Gaussian Processes for Structured Prediction

International Journal of Computer Vision
A Study of Parts-Based Object Class Detection Using Complete Graphs

International Journal of Computer Vision
Optimization and Filtering for Human Motion Capture

International Journal of Computer Vision
Physics-Based Person Tracking Using the Anthropomorphic Walker

International Journal of Computer Vision
Silhouette representation and matching for 3D pose discrimination - A comparative study

Image and Vision Computing
A variational approach to monocular hand-pose estimation

Computer Vision and Image Understanding
Discriminative human action recognition in the learned hierarchical manifold space

Image and Vision Computing
Occlusion modeling by tracking multiple objects

Proceedings of the 29th DAGM conference on Pattern recognition
Shared latent dynamical model for human tracking from videos

MCAM'07 Proceedings of the 2007 international conference on Multimedia content analysis and mining
VideoMocap: modeling physically realistic human motion from monocular video sequences

ACM SIGGRAPH 2010 papers
Capturing 3D human motion from monocular images using orthogonal locality preserving projection

ICDHM'07 Proceedings of the 1st international conference on Digital human modeling
Reconstruct 3D human motion from monocular video using motion library

MMM'08 Proceedings of the 14th international conference on Advances in multimedia modeling
Gaussian process latent variable models for human pose estimation

MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction
Advances in view-invariant human motion analysis: a review

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Dual gait generative models for human motion estimation from a single camera

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on gait analysis
A shape descriptor for real time 3D foot pose estimation

ISCGAV'10 Proceedings of the 10th WSEAS international conference on Signal processing, computational geometry and artificial vision
MovieReshape: tracking and reshaping of humans in videos

ACM SIGGRAPH Asia 2010 papers
Kinematic self retargeting: A framework for human pose estimation

Computer Vision and Image Understanding
Multiple-activity human body tracking in unconstrained environments

AMDO'10 Proceedings of the 6th international conference on Articulated motion and deformable objects
Self-occlusion handling for human body motion tracking from 3D ToF image sequence

Proceedings of the 1st international workshop on 3D video processing
We are family: joint pose estimation of multiple persons

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Human posture recognition for intelligent vehicles

Journal of Real-Time Image Processing
Augmenting hand animation with three-dimensional secondary motion

Proceedings of the 2010 ACM SIGGRAPH/Eurographics Symposium on Computer Animation
2D action recognition serves 3D human pose estimation

ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
A two-stage Bayesian network method for 3D human pose estimation from monocular image sequences

EURASIP Journal on Advances in Signal Processing - Special issue on video analysis for human behavior understanding
A survey of vision-based methods for action representation, segmentation and recognition

Computer Vision and Image Understanding
Integration of bottom-up/top-down approaches for 2D pose estimation using probabilistic Gaussian modelling

Computer Vision and Image Understanding
3D human pose recovery from image by efficient visual feature selection

Computer Vision and Image Understanding
Human pose estimation using exemplars and part based refinement

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Feature-Driven Direct Non-Rigid Image Registration

International Journal of Computer Vision
Face sketch synthesis via multivariate output regression

HCII'11 Proceedings of the 14th international conference on Human-computer interaction: design and development approaches - Volume Part I
Recognizing multiple human activities and tracking full-body pose in unconstrained environments

Pattern Recognition
Estimation of human orientation in images captured with a range camera

ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
Upper Body Detection and Tracking in Extended Signing Sequences

International Journal of Computer Vision
Estimation of 3-D human body posture via co-registration of 3-D human model and sequential stereo information

Applied Intelligence
Efficient and robust shape matching for model based human motion capture

DAGM'11 Proceedings of the 33rd international conference on Pattern recognition
Three-dimensional proxies for hand-drawn characters

ACM Transactions on Graphics (TOG)
Nonparametric density estimation for human pose tracking

DAGM'06 Proceedings of the 28th conference on Pattern Recognition
Skin colour segmentation based 2D and 3D human pose modelling using Discrete Wavelet Transform

Pattern Recognition and Image Analysis
Multi-view human movement recognition based on fuzzy distances and linear discriminant analysis

Computer Vision and Image Understanding
Multi-view 3D Human Pose Estimation in Complex Environment

International Journal of Computer Vision
Towards robust 3d reconstruction of human motion from monocular video

ICAT'06 Proceedings of the 16th international conference on Advances in Artificial Reality and Tele-Existence
Multiple people tracking and pose estimation with occlusion estimation

Computer Vision and Image Understanding
Estimating human pose from occluded images

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part I
Temporal-Spatial local gaussian process experts for human pose estimation

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part I
Discriminative human full-body pose estimation from wearable inertial sensor data

3DPH'09 Proceedings of the 2009 international conference on Modelling the Physiological Human
Dynamic kernel-based progressive particle filter for 3d human motion tracking

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part II
Human body pose estimation from still images and video frames

ICIAR'10 Proceedings of the 7th international conference on Image Analysis and Recognition - Volume Part I
Human motion tracking with monocular video by introducing a graph structure into gaussian process dynamical models

PSIVT'11 Proceedings of the 5th Pacific Rim conference on Advances in Image and Video Technology - Volume Part I
Loose-limbed People: Estimating 3D Human Pose and Motion Using Non-parametric Belief Propagation

International Journal of Computer Vision
3D hand tracking for human computer interaction

Image and Vision Computing
Globally Optimal Estimation of Nonrigid Image Distortion

International Journal of Computer Vision
Fast Human Pose Detection Using Randomized Hierarchical Cascades of Rejectors

International Journal of Computer Vision
A Self-Training Approach for Visual Tracking and Recognition of Complex Human Activity Patterns

International Journal of Computer Vision
Graph based semi-supervised human pose estimation: When the output space comes to help

Pattern Recognition Letters
Coupled Action Recognition and Pose Estimation from Multiple Views

International Journal of Computer Vision
Full body motion tracking in monocular images using particle swarm optimization

ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part I
Human typical action recognition using gray scale image of silhouette sequence

Computers and Electrical Engineering
State of the Art Report on Video-Based Graphics and Video Visualization

Computer Graphics Forum
Topic based pose relevance learning in dance archives

Proceedings of the 21st ACM international conference on Information and knowledge management
Adaptive occlusion state estimation for human pose tracking under self-occlusions

Pattern Recognition
Efficient articulated trajectory reconstruction using dynamic programming and filters

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Action recognition using subtensor constraint

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Consensus algorithms in a multi-agent framework to solve PTZ camera reconfiguration in UAVs

ICIRA'12 Proceedings of the 5th international conference on Intelligent Robotics and Applications - Volume Part I
No bias left behind: covariate shift adaptation for discriminative 3d pose estimation

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Part template: 3D representation for multiview human pose estimation

Pattern Recognition
Editor's choice article: Canonical locality preserving Latent Variable Model for discriminative pose inference

Image and Vision Computing
Hierarchical conditional random fields for myocardium infarction detection

STACOM'12 Proceedings of the third international conference on Statistical Atlases and Computational Models of the Heart: imaging and modelling challenges
Two-layer dual gait generative models for human motion estimation from a single camera

Image and Vision Computing
Non-parametric hand pose estimation with object context

Image and Vision Computing
Discriminative fusion of shape and appearance features for human pose estimation

Pattern Recognition
A new hierarchical method for markerless human pose estimation

ICVS'13 Proceedings of the 9th international conference on Computer Vision Systems
Data-driven suggestions for portrait posing

SIGGRAPH Asia 2013 Technical Briefs
Mixtures of Gaussian process models for human pose estimation

Image and Vision Computing
Exploiting projective geometry for view-invariant monocular human motion analysis in man-made environments

Computer Vision and Image Understanding
Generative tracking of 3D human motion in latent space by sequential clonal selection algorithm

Multimedia Tools and Applications
Combination of annealing particle filter and belief propagation for 3D upper body tracking

Applied Bionics and Biomechanics - Personal Care Robotics

Quantified Score

Hi-index	0.14

Visualization

Abstract

We describe a learning-based method for recovering 3D human body pose from single images and monocular image sequences. Our approach requires neither an explicit body model nor prior labeling of body parts in the image. Instead, it recovers pose by direct nonlinear regression against shape descriptor vectors extracted automatically from image silhouettes. For robustness against local silhouette segmentation errors, silhouette shape is encoded by histogram-of-shape-contexts descriptors. We evaluate several different regression methods: ridge regression, Relevance Vector Machine (RVM) regression, and Support Vector Machine (SVM) regression over both linear and kernel bases. The RVMs provide much sparser regressors without compromising performance, and kernel bases give a small but worthwhile improvement in performance. The loss of depth and limb labeling information often makes the recovery of 3D pose from single silhouettes ambiguous. To handle this, the method is embedded in a novel regressive tracking framework, using dynamics from the previous state estimate together with a learned regression value to disambiguate the pose. We show that the resulting system tracks long sequences stably. For realism and good generalization over a wide range of viewpoints, we train the regressors on images resynthesized from real human motion capture data. The method is demonstrated for several representations of full body pose, both quantitatively on independent but similar test data and qualitatively on real image sequences. Mean angular errors of 4{\hbox{-}}6^\circ are obtained for a variety of walking motions.