Coupled Action Recognition and Pose Estimation from Multiple Views

  • Authors:
  • Angela Yao;Juergen Gall;Luc Gool

  • Affiliations:
  • Computer Vision Laboratory, ETH Zurich, Zurich, Switzerland 8092;Computer Vision Laboratory, ETH Zurich, Zurich, Switzerland 8092 and Max Planck Institute for Intelligent Systems, Tubingen, Germany 72076;Computer Vision Laboratory, ETH Zurich, Zurich, Switzerland 8092 and Department of Electrical Engineering/IBBT, K.U. Leuven, Heverlee, Belgium 3001

  • Venue:
  • International Journal of Computer Vision
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Action recognition and pose estimation are two closely related topics in understanding human body movements; information from one task can be leveraged to assist the other, yet the two are often treated separately. We present here a framework for coupled action recognition and pose estimation by formulating pose estimation as an optimization over a set of action-specific manifolds. The framework allows for integration of a 2D appearance-based action recognition system as a prior for 3D pose estimation and for refinement of the action labels using relational pose features based on the extracted 3D poses. Our experiments show that our pose estimation system is able to estimate body poses with high degrees of freedom using very few particles and can achieve state-of-the-art results on the HumanEva-II benchmark. We also thoroughly investigate the impact of pose estimation and action recognition accuracy on each other on the challenging TUM kitchen dataset. We demonstrate not only the feasibility of using extracted 3D poses for action recognition, but also improved performance in comparison to action recognition using low-level appearance features.