Subject independent human action recognition using spatio-depth information and meta-cognitive RBF network

Authors:
R. Venkatesh Babu;R. Savitha;S. Suresh;Bhuvnesh Agarwal
Affiliations:
-;-;-;-
Venue:
Engineering Applications of Artificial Intelligence
Year:
2013

Citing 15
Cited 0

The Recognition of Human Movement Using Temporal Templates

IEEE Transactions on Pattern Analysis and Machine Intelligence
Recognizing Action at a Distance

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Free viewpoint action recognition using motion history volumes

Computer Vision and Image Understanding - Special issue on modeling people: Vision-based understanding of a person's shape, appearance, movement, and behaviour
Actions as Space-Time Shapes

IEEE Transactions on Pattern Analysis and Machine Intelligence
A differential geometric approach to representing the human actions

Computer Vision and Image Understanding
Risk-sensitive loss functions for sparse multi-category classification problems

Information Sciences: an International Journal
An iterative image registration technique with an application to stereo vision

IJCAI'81 Proceedings of the 7th international joint conference on Artificial intelligence - Volume 2
Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence
View-invariant gesture recognition using 3D optical flow and harmonic motion context

Computer Vision and Image Understanding
A survey of vision-based methods for action representation, segmentation and recognition

Computer Vision and Image Understanding
Evaluating Learning Algorithms: A Classification Perspective

Evaluating Learning Algorithms: A Classification Perspective
View-invariant modeling and recognition of human actions using grammars

WDV'05/WDV'06/ICCV'05/ECCV'06 Proceedings of the 2005/2006 international conference on Dynamical vision
Meta-cognitive RBF Network and its Projection Based Learning algorithm for classification problems

Applied Soft Computing
Berkeley MHAD: A comprehensive Multimodal Human Action Database

WACV '13 Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision (WACV)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a machine learning approach for subject independent human action recognition using depth camera, emphasizing the importance of depth in recognition of actions. The proposed approach uses the flow information of all 3 dimensions to classify an action. In our approach, we have obtained the 2-D optical flow and used it along with the depth image to obtain the depth flow (Z motion vectors). The obtained flow captures the dynamics of the actions in space-time. Feature vectors are obtained by averaging the 3-D motion over a grid laid over the silhouette in a hierarchical fashion. These hierarchical fine to coarse windows capture the motion dynamics of the object at various scales. The extracted features are used to train a Meta-cognitive Radial Basis Function Network (McRBFN) that uses a Projection Based Learning (PBL) algorithm, referred to as PBL-McRBFN, henceforth. PBL-McRBFN begins with zero hidden neurons and builds the network based on the best human learning strategy, namely, self-regulated learning in a meta-cognitive environment. When a sample is used for learning, PBL-McRBFN uses the sample overlapping conditions, and a projection based learning algorithm to estimate the parameters of the network. The performance of PBL-McRBFN is compared to that of a Support Vector Machine (SVM) and Extreme Learning Machine (ELM) classifiers with representation of every person and action in the training and testing datasets. Performance study shows that PBL-McRBFN outperforms these classifiers in recognizing actions in 3-D. Further, a subject-independent study is conducted by leave-one-subject-out strategy and its generalization performance is tested. It is observed from the subject-independent study that McRBFN is capable of generalizing actions accurately. The performance of the proposed approach is benchmarked with Video Analytics Lab (VAL) dataset and Berkeley Multi-modal Human Action Database (MHAD).