Part template: 3D representation for multiview human pose estimation

Authors:
Jianfeng Shen;Wenming Yang;Qingmin Liao
Affiliations:
Tsinghua University, Tsinghua Campus, Shenzhen 518055, China;Tsinghua University, Tsinghua Campus, Shenzhen 518055, China;Tsinghua University, Tsinghua Campus, Shenzhen 518055, China
Venue:
Pattern Recognition
Year:
2013

Citing 29
Cited 1

Integrated Person Tracking Using Stereo, Color, and Pattern Detection

International Journal of Computer Vision - Special issue on a special section on visual surveillance
Model-Based Estimation of 3D Human Motion

IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient greedy learning of Gaussian mixture models

Neural Computation
Real-Time Tracking of Articulated Human Models Using a 3D Shape-from-Silhouette Method

RobVis '01 Proceedings of the International Workshop on Robot Vision
3-D model-based tracking of humans in action: a multi-view approach

CVPR '96 Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96)
Tracking People with Twists and Exponential Maps

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Tracking a Person with 3-D Motion by Integrating Optical Flow and Depth

FG '00 Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition 2000
Automated Body Modeling from Video Sequences

MPEOPLE '99 Proceedings of the IEEE International Workshop on Modelling People
A Model Driven 3D Image Interpretation System Applied to Person Detection in Video Images

ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 1 - Volume 1
Articulated Body Motion Capture by Stochastic Search

International Journal of Computer Vision
Full Body Tracking from Multiple Views Using Stochastic Sampling

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Recovering 3D Human Pose from Monocular Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
A survey of advances in vision-based human motion capture and analysis

Computer Vision and Image Understanding - Special issue on modeling people: Vision-based understanding of a person's shape, appearance, movement, and behaviour
Temporal motion models for monocular and multiview 3D human body tracking

Computer Vision and Image Understanding - Special issue on modeling people: Vision-based understanding of a person's shape, appearance, movement, and behaviour
3D Skeleton-Based Body Pose Recovery

3DPVT '06 Proceedings of the Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06)
Curve-Skeleton Properties, Applications, and Algorithms

IEEE Transactions on Visualization and Computer Graphics
Vision-based human motion analysis: An overview

Computer Vision and Image Understanding
Model Driven Segmentation of Articulating Humans in Laplacian Eigenspace

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exploiting motion correlations in 3-D articulated human motion tracking

IEEE Transactions on Image Processing
HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion

International Journal of Computer Vision
Towards a low cost multi-camera marker based human motion capture system

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
3D human motion tracking based on a progressive particle filter

Pattern Recognition
Action and gait recognition from recovered 3-D human joints

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on gait analysis
Human tracking using convolutional neural networks

IEEE Transactions on Neural Networks
Integration of bottom-up/top-down approaches for 2D pose estimation using probabilistic Gaussian modelling

Computer Vision and Image Understanding
3D human pose recovery from image by efficient visual feature selection

Computer Vision and Image Understanding
Estimating complicated and overlapped human body postures by wearing a multiple-colored suit using color information processing

FGR' 04 Proceedings of the Sixth IEEE international conference on Automatic face and gesture recognition
Shape-from-silhouette of articulated objects and its use for human body kinematics estimation and motion capture

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Multi-view 3D Human Pose Estimation in Complex Environment

International Journal of Computer Vision

Principal direction analysis-based real-time 3D human pose reconstruction from a single depth image

Proceedings of the Fourth Symposium on Information and Communication Technology

Quantified Score

Hi-index	0.01

Visualization

Abstract

We present a system for human pose estimation from synchronized multiview images. The system uses an analysis-by-synthesis approach with a skeleton model. This approach is powerful, but may present issues with its potentially huge search space. We adopt a hierarchical method where the head and torso are found first based on template fitting. The detection of the other parts then proceeds with the shoulders and hips to locate the anchor points of the limbs. Subsequently, a hierarchical fitting technique is used to estimate the location of the limbs. The parameter space is then partitioned, which dramatically reduces the complexity of pose estimation. Another difficulty of this system is to find adequate measurements which are used to fit the skeleton model. A multi-cue 3D fusion method is proposed for this purpose. It starts with extracting a set of cues from synchronized multiview images which exploit geometric and color information, and they are then integrated into a 3D representation, called a ''part template''. The experiments show that this system reliably performs on sequences that include unconstrained motions, such as those that are fast or unpredictable, and is also robust to several common issues associated with input data, such as image noise and self-contact.