Multiple cue integrated action detection

Authors:
Sang-Hack Jung;Yanlin Guo;Harpreet Sawhney;Rakesh Kumar
Affiliations:
Sarnoff Corporation, Princeton, NJ;Sarnoff Corporation, Princeton, NJ;Sarnoff Corporation, Princeton, NJ;Sarnoff Corporation, Princeton, NJ
Venue:
HCI'07 Proceedings of the 2007 IEEE international conference on Human-computer interaction
Year:
2007

Citing 9
Cited 1

The Design and Use of Steerable Filters

IEEE Transactions on Pattern Analysis and Machine Intelligence
Elliptical Head Tracking Using Intensity Gradients and Color Histograms

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Detecting Pedestrians Using Patterns of Motion and Appearance

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Space-Time Behavior Based Correlation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Unsupervised Learning of Discriminative Edge Measures for Vehicle Matching between Non-Overlapping Cameras

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Efficient Visual Event Detection Using Volumetric Features

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Learning Exemplar-Based Categorization for the Detection of Multi-View Multi-Pose Objects

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2

Human-computer intelligent interaction: a survey

HCI'07 Proceedings of the 2007 IEEE international conference on Human-computer interaction

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present an action recognition scheme that integrates multiple modality of cues that include shape, motion and depth to recognize human gesture in the video sequences. In the proposed approach we extend classification framework that is commonly used in 2D object recognition to 3D spatio-temporal space for recognizing actions. Specifically, a boosting-based classifier is used that learns spatio-temporal features specific to target actions where features are obtained from temporal patterns of shape contour, optical flow and depth changes occuring at local body parts. The individual features exhibit different strength and sensitivity depending on many factors that include action, underlying body parts and background. In the current method, the multiple cues of different modalities are combined optimally by fisher linear discriminant to form a strong feature that preserve strength of individual cues. In the experiment, we apply the integrated action classifier on a set of target actions and evaluate its performance by comparing with single cue-based cases and present qualitative analysis of performance gain.