Multiple cue integrated action detection

  • Authors:
  • Sang-Hack Jung;Yanlin Guo;Harpreet Sawhney;Rakesh Kumar

  • Affiliations:
  • Sarnoff Corporation, Princeton, NJ;Sarnoff Corporation, Princeton, NJ;Sarnoff Corporation, Princeton, NJ;Sarnoff Corporation, Princeton, NJ

  • Venue:
  • HCI'07 Proceedings of the 2007 IEEE international conference on Human-computer interaction
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present an action recognition scheme that integrates multiple modality of cues that include shape, motion and depth to recognize human gesture in the video sequences. In the proposed approach we extend classification framework that is commonly used in 2D object recognition to 3D spatio-temporal space for recognizing actions. Specifically, a boosting-based classifier is used that learns spatio-temporal features specific to target actions where features are obtained from temporal patterns of shape contour, optical flow and depth changes occuring at local body parts. The individual features exhibit different strength and sensitivity depending on many factors that include action, underlying body parts and background. In the current method, the multiple cues of different modalities are combined optimally by fisher linear discriminant to form a strong feature that preserve strength of individual cues. In the experiment, we apply the integrated action classifier on a set of target actions and evaluate its performance by comparing with single cue-based cases and present qualitative analysis of performance gain.