Human action recognition using a temporal hierarchy of covariance descriptors on 3D joint locations

  • Authors:
  • Mohamed E. Hussein;Marwan Torki;Mohammad A. Gowayyed;Motaz El-Saban

  • Affiliations:
  • Department of Computer and Systems Engineering, Alexandria University, Alexandria, Egypt;Department of Computer and Systems Engineering, Alexandria University, Alexandria, Egypt;Department of Computer and Systems Engineering, Alexandria University, Alexandria, Egypt;Microsoft Research Advanced Technology Lab Cairo, Cairo, Egypt

  • Venue:
  • IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Human action recognition from videos is a challenging machine vision task with multiple important application domains, such as human-robot/machine interaction, interactive entertainment, multimedia information retrieval, and surveillance. In this paper, we present a novel approach to human action recognition from 3D skeleton sequences extracted from depth data. We use the covariance matrix for skeleton joint locations over time as a discriminative descriptor for a sequence. To encode the relationship between joint movement and time, we deploy multiple covariance matrices over sub-sequences in a hierarchical fashion. The descriptor has a fixed length that is independent from the length of the described sequence. Our experiments show that using the covariance descriptor with an off-the-shelf classification algorithm outperforms the state of the art in action recognition on multiple datasets, captured either via a Kinect-type sensor or a sophisticated motion capture system. We also include an evaluation on a novel large dataset using our own annotation.