HumanEva: Synchronized Video and Motion Capture Dataset and Baseline Algorithm for Evaluation of Articulated Human Motion

Authors:
Leonid Sigal;Alexandru O. Balan;Michael J. Black
Affiliations:
Dept. of Computer Science, University of Toronto, Toronto, Canada M5S 3H5;Dept. of Computer Science, Brown University, Providence, USA 02912;Dept. of Computer Science, Brown University, Providence, USA 02912
Venue:
International Journal of Computer Vision
Year:
2010

Citing 50
Cited 57

CONDENSATION—Conditional Density Propagation forVisual Tracking

International Journal of Computer Vision
The visual analysis of human movement: a survey

Computer Vision and Image Understanding
The FERET Evaluation Methodology for Face-Recognition Algorithms

IEEE Transactions on Pattern Analysis and Machine Intelligence
Reconstruction of articulated objects from point correspondences in a single uncalibrated image

Computer Vision and Image Understanding
Tracking persons in monocular image sequences

Computer Vision and Image Understanding
A survey of computer vision-based human motion capture

Computer Vision and Image Understanding - Modeling people toward vision-based underatanding of a person's shape, appearance, and movement
A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms

International Journal of Computer Vision
On sequential Monte Carlo sampling methods for Bayesian filtering

Statistics and Computing
Implicit Probabilistic Models of Human Motion for Synthesis and Tracking

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Learning to Parse Pictures of People

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Learning the Statistics of People in Images and Video

International Journal of Computer Vision - Special Issue on Computational Vision at Brown University
3-D model-based tracking of humans in action: a multi-view approach

CVPR '96 Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96)
Model-based estimation of 3D human motion with occlusion based on active multi-viewpoint selection

CVPR '96 Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96)
Tracking People with Twists and Exponential Maps

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Cardboard People: A Parameterized Model of Articulated Image Motion

FG '96 Proceedings of the 2nd International Conference on Automatic Face and Gesture Recognition (FG '96)
A Framework for Modeling the Appearance of 3D Articulated Figures

FG '00 Proceedings of the Fourth IEEE International Conference on Automatic Face and Gesture Recognition 2000
Hybrid Monte Carlo Filtering: Edge-Based People Tracking

MOTION '02 Proceedings of the Workshop on Motion and Video Computing
Consistency and Coupling in Human Model Likelihoods

FGR '02 Proceedings of the Fifth IEEE International Conference on Automatic Face and Gesture Recognition
Inferring 3D Structure with a Statistical Image-Based Shape Model

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Fast Pose Estimation with Parameter-Sensitive Hashing

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Learning to track 3D human motion from silhouettes

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Generative modeling for continuous non-linearly embedded visual inference

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Pictorial Structures for Object Recognition

International Journal of Computer Vision
Articulated Body Motion Capture by Stochastic Search

International Journal of Computer Vision
The HumanID Gait Challenge Problem: Data Sets, Performance, and Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
Strike a Pose: Tracking People by Finding Stylized Poses

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Discriminative Density Propagation for 3D Human Motion Estimation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Learning to Estimate Human Pose with Data Driven Belief Propagation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Priors for People Tracking from Small Training Sets

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Beyond Trees: Common-Factor Models for 2D Human Pose Recovery

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Recovering Human Body Configurations Using Pairwise Constraints between Parts

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Guiding Model Search Using Segmentation

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
An Adaptive Appearance Model Approach for Model-based Articulated Object Tracking

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
A Modular Approach to the Analysis and Evaluation of Particle Filters for Figure Tracking

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
3D People Tracking with Gaussian Process Dynamical Models

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Body Localization in Still Images Using Hierarchical Models and Hybrid Search

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Measure Locally, Reason Globally: Occlusion-sensitive Articulated Pose Estimation

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Human body pose detection using Bayesian spatio-temporal templates

Computer Vision and Image Understanding - Special issue on modeling people: Vision-based understanding of a person's shape, appearance, movement, and behaviour
A Quantitative Evaluation of Video-based 3D Person Tracking

ICCCN '05 Proceedings of the 14th International Conference on Computer Communications and Networks
Articulated mesh animation from multi-view silhouettes

ACM SIGGRAPH 2008 papers
Human Motion Tracking with a Kinematic Parameterization of Extremal Contours

International Journal of Computer Vision
Bottom-up recognition and parsing of the human body

EMMCVPR'07 Proceedings of the 6th international conference on Energy minimization methods in computer vision and pattern recognition
Recovering human body configurations: combining segmentation and recognition

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
3D human pose from silhouettes by relevance vector regression

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Kinematic jump processes for monocular 3D human tracking

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Learning for multi-view 3d tracking in the context of particle filters

ISVC'06 Proceedings of the Second international conference on Advances in Visual Computing - Volume Part II
Human pose tracking using multi-level structured models

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
Monocular tracking of 3d human motion with a coordinated mixture of factor analyzers

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
A tutorial on particle filters for online nonlinear/non-GaussianBayesian tracking

IEEE Transactions on Signal Processing
Robust online appearance models for visual tracking

IEEE Transactions on Pattern Analysis and Machine Intelligence

Guest Editorial: State of the Art in Image- and Video-Based Human Pose and Motion Estimation

International Journal of Computer Vision
Silhouette representation and matching for 3D pose discrimination - A comparative study

Image and Vision Computing
Tracking human pose with multiple activity models

Pattern Recognition
Estimating 3D pose via stochastic search and expectation maximization

AMDO'10 Proceedings of the 6th international conference on Articulated motion and deformable objects
Real time multiple people tracking and pose estimation

Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis
A generic approach to design and querying of multi-purpose human motion database

ICCVG'10 Proceedings of the 2010 international conference on Computer vision and graphics: Part I
Human attributes from 3D pose tracking

ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
2D action recognition serves 3D human pose estimation

ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
Integration of bottom-up/top-down approaches for 2D pose estimation using probabilistic Gaussian modelling

Computer Vision and Image Understanding
3D human pose recovery from image by efficient visual feature selection

Computer Vision and Image Understanding
Behavioural analysis with movement cluster model for concurrent actions

Journal on Image and Video Processing - Special issue on advanced video-based surveillance
Marker-based human motion capture in multiview sequences

EURASIP Journal on Advances in Signal Processing
Gradual sampling and mutual information maximisation for markerless motion capture

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Human 3D motion computation from a varying number of cameras

SCIA'11 Proceedings of the 17th Scandinavian conference on Image analysis
Recognizing multiple human activities and tracking full-body pose in unconstrained environments

Pattern Recognition
Swarm intelligence based searching schemes for articulated 3D body motion tracking

ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
Multiview human pose estimation with unconstrained motions

Pattern Recognition Letters
Estimating pose of articulated objects using low-level motion

Computer Vision and Image Understanding
Multi-view 3D Human Pose Estimation in Complex Environment

International Journal of Computer Vision
Human attributes from 3D pose tracking

Computer Vision and Image Understanding
Multiple people tracking and pose estimation with occlusion estimation

Computer Vision and Image Understanding
Human motion tracking with monocular video by introducing a graph structure into gaussian process dynamical models

PSIVT'11 Proceedings of the 5th Pacific Rim conference on Advances in Image and Video Technology - Volume Part I
Hierarchical pose estimation for human gait analysis

Computer Methods and Programs in Biomedicine
Video-based 3D motion capture through biped control

ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
Loose-limbed People: Estimating 3D Human Pose and Motion Using Non-parametric Belief Propagation

International Journal of Computer Vision
Real-Time tracking of full-body motion using parallel particle swarm optimization with a pool of best particles

SIDE'12 Proceedings of the 2012 international conference on Swarm and Evolutionary Computation
3D Human model adaptation by frame selection and shape-texture optimization

Computer Vision and Image Understanding
Fast Human Pose Detection Using Randomized Hierarchical Cascades of Rejectors

International Journal of Computer Vision
Coupled Action Recognition and Pose Estimation from Multiple Views

International Journal of Computer Vision
Real-time multi-view human motion tracking using particle swarm optimization with resampling

AMDO'12 Proceedings of the 7th international conference on Articulated Motion and Deformable Objects
Motion Coherent Tracking Using Multi-label MRF Optimization

International Journal of Computer Vision
Adaptive occlusion state estimation for human pose tracking under self-occlusions

Pattern Recognition
Particle swarm optimization with soft search space partitioning for video-based markerless pose tracking

ACIVS'12 Proceedings of the 14th international conference on Advanced Concepts for Intelligent Vision Systems
Kernelized temporal cut for online temporal segmentation and recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Full body performance capture under uncontrolled and varying illumination: a shading-based approach

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
Choosing and modeling the hand gesture database for a natural user interface

GW'11 Proceedings of the 9th international conference on Gesture and Sign Language in Human-Computer Interaction and Embodied Communication
What, why, where and how do children think? towards a dynamic model of spatial cognition as action

GW'11 Proceedings of the 9th international conference on Gesture and Sign Language in Human-Computer Interaction and Embodied Communication
View independent human gait recognition using markerless 3d human motion capture

ICCVG'12 Proceedings of the 2012 international conference on Computer Vision and Graphics
Parametric annealing: A stochastic search method for human pose tracking

Pattern Recognition
Exploring the Trade-off Between Accuracy and Observational Latency in Action Recognition

International Journal of Computer Vision
Part template: 3D representation for multiview human pose estimation

Pattern Recognition
A survey of video datasets for human action and activity recognition

Computer Vision and Image Understanding
The jiku mobile video dataset

Proceedings of the 4th ACM Multimedia Systems Conference
Two-layer dual gait generative models for human motion estimation from a single camera

Image and Vision Computing
GLocal structural feature selection with sparsity for multimedia data understanding

Proceedings of the 21st ACM international conference on Multimedia
On-set performance capture of multiple actors with a stereo camera

ACM Transactions on Graphics (TOG)
Discriminative fusion of shape and appearance features for human pose estimation

Pattern Recognition
Markov Random Field modeling, inference & learning in computer vision & image understanding: A survey

Computer Vision and Image Understanding
A new hierarchical method for markerless human pose estimation

ICVS'13 Proceedings of the 9th international conference on Computer Vision Systems
ChAirGest: a challenge for multimodal mid-air gesture recognition for close HCI

Proceedings of the 15th ACM on International conference on multimodal interaction
Genetic programming extension to APF-based monocular human body pose estimation

Multimedia Tools and Applications
Charting-based subspace learning for video-based human action classification

Machine Vision and Applications
Regressing Local to Global Shape Properties for Online Segmentation and Tracking

International Journal of Computer Vision
Comparing evolutionary algorithms and particle filters for Markerless Human Motion Capture

Applied Soft Computing
Exploiting projective geometry for view-invariant monocular human motion analysis in man-made environments

Computer Vision and Image Understanding
Generative tracking of 3D human motion in latent space by sequential clonal selection algorithm

Multimedia Tools and Applications
The Shape Boltzmann Machine: A Strong Model of Object Shape

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

While research on articulated human motion and pose estimation has progressed rapidly in the last few years, there has been no systematic quantitative evaluation of competing methods to establish the current state of the art. We present data obtained using a hardware system that is able to capture synchronized video and ground-truth 3D motion. The resulting HumanEva datasets contain multiple subjects performing a set of predefined actions with a number of repetitions. On the order of 40,000 frames of synchronized motion capture and multi-view video (resulting in over one quarter million image frames in total) were collected at 60 Hz with an additional 37,000 time instants of pure motion capture data. A standard set of error measures is defined for evaluating both 2D and 3D pose estimation and tracking algorithms. We also describe a baseline algorithm for 3D articulated tracking that uses a relatively standard Bayesian framework with optimization in the form of Sequential Importance Resampling and Annealed Particle Filtering. In the context of this baseline algorithm we explore a variety of likelihood functions, prior models of human motion and the effects of algorithm parameters. Our experiments suggest that image observation models and motion priors play important roles in performance, and that in a multi-view laboratory environment, where initialization is available, Bayesian filtering tends to perform well. The datasets and the software are made available to the research community. This infrastructure will support the development of new articulated motion and pose estimation algorithms, will provide a baseline for the evaluation and comparison of new methods, and will help establish the current state of the art in human pose estimation and tracking.