2D Human Pose Estimation in TV Shows

Authors:
Vittorio Ferrari;Manuel Marín-Jiménez;Andrew Zisserman
Affiliations:
ETH Zurich,;University of Granada,;University of Oxford,
Venue:
Statistical and Geometrical Approaches to Visual Motion Analysis
Year:
2009

Citing 0
Cited 2

Discriminative Appearance Models for Pictorial Structures

International Journal of Computer Vision
People orientation recognition by mixtures of wrapped distributions on random trees

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V

Quantified Score

Hi-index	0.00

Visualization

Abstract

The goal of this work is fully automatic 2D human pose estimation in unconstrained TV shows and feature films. Direct pose estimation on this uncontrolled material is often too difficult, especially when knowing nothing about the location, scale, pose, and appearance of the person, or even whether there is a person in the frame or not.We propose an approach that progressively reduces the search space for body parts, to greatly facilitate the task for the pose estimator. Moreover, when video is available, we propose methods for exploiting the temporal continuity of both appearance and pose for improving the estimation based on individual frames.The method is fully automatic and self-initializing, and explains the spatio-temporal volume covered by a person moving in a shot by soft-labeling every pixel as belonging to a particular body part or to the background. We demonstrate upper-body pose estimation by running our system on four episodes of the TV series Buffy the vampire slayer (i.e. three hours of video). Our approach is evaluated quantitatively on several hundred video frames, based on ground-truth annotation of 2D poses. Finally, we present an application to full-body action recognition on the Weizmann dataset.