Latent gaussian mixture regression for human pose estimation

Authors:
Yan Tian;Leonid Sigal;Hernán Badino;Fernando De la Torre;Yong Liu
Affiliations:
Beijing University of Posts and Telecommunications, Beijing, P.R. China and Carnegie Mellon University, Pittsburgh;Disney Research, Pittsburgh;Carnegie Mellon University, Pittsburgh;Carnegie Mellon University, Pittsburgh;Beijing University of Posts and Telecommunications, Beijing, P.R. China
Venue:
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Year:
2010

Citing 11
Cited 1

Stochastic Tracking of 3D Human Figures Using 2D Image Motion

ECCV '00 Proceedings of the 6th European Conference on Computer Vision-Part II
Fast Pose Estimation with Parameter-Sensitive Hashing

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Discriminative Density Propagation for 3D Human Motion Estimation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Articulated Pose Estimation in a Learned Smooth Space of Feasible Solutions

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Monocular Human Motion Capture with a Mixture of Regressors

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Learning Joint Top-Down and Bottom-up Processes for 3D Visual Inference

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Learning and Inference of 3D Human Poses from Gaussian Mixture Modeled Silhouettes

ICPR '06 Proceedings of the 18th International Conference on Pattern Recognition - Volume 02
Learning Generative Models for Multi-Activity Body Pose Estimation

International Journal of Computer Vision
Gaussian process latent variable models for human pose estimation

MLMI'07 Proceedings of the 4th international conference on Machine learning for multimodal interaction
Inferring 3D body pose from silhouettes using activity manifold learning

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
3D human pose from silhouettes by relevance vector regression

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Editor's choice article: Canonical locality preserving Latent Variable Model for discriminative pose inference

Image and Vision Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Discriminative approaches for human pose estimation model the functional mapping, or conditional distribution, between image features and 3D pose. Learning such multi-modal models in high dimensional spaces, however, is challenging with limited training data; often resulting in over-fitting and poor generalization. To address these issues latent variable models (LVMs) have been introduced. Shared LVMs attempt to learn a coherent, typically non-linear, latent space shared by image features and 3D poses, distribution of data in that latent space, and conditional distributions to and from this latent space to carry out inference. Discovering the shared manifold structure can, in itself, however, be challenging. In addition, shared LVMs models are most often non-parametric, requiring the model representation to be a function of the training set size. We present a parametric framework that addresses these shortcoming. In particular, we learn latent spaces, and distributions within them, for image features and 3D poses separately first, and then learn a multi-modal conditional density between these two lowdimensional spaces in the form of Gaussian Mixture Regression. Using our model we can address the issue of over-fitting and generalization, since the data is denser in the learned latent space, as well as avoid the necessity of learning a shared manifold for the data. We quantitatively evaluate and compare the performance of the proposed method to several state-of-the-art alternatives, and show that our method gives a competitive performance.