Human action recognition based on graph-embedded spatio-temporal subspace

Authors:
Chien-Chung Tseng;Ju-Chin Chen;Ching-Hsien Fang;Jenn-Jier James Lien
Affiliations:
Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, Taiwan, ROC;Department of Computer Science and Information Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung 80778, Taiwan, ROC;Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, Taiwan, ROC;Department of Computer Science and Information Engineering, National Cheng Kung University, Tainan 70101, Taiwan, ROC
Venue:
Pattern Recognition
Year:
2012

Citing 21
Cited 1

Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Parameterized modeling and recognition of activities

Computer Vision and Image Understanding
The Recognition of Human Movement Using Temporal Templates

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning and Recognizing Human Dynamics in Video Sequences

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Recognizing Action at a Distance

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Fusion of Static and Dynamic Body Biometrics for Gait Recognition

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
On Space-Time Interest Points

International Journal of Computer Vision
Efficient Visual Event Detection Using Volumetric Features

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Joint Boosting Feature Selection for Robust Face Recognition

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Graph Embedding and Extensions: A General Framework for Dimensionality Reduction

IEEE Transactions on Pattern Analysis and Machine Intelligence
Behavior recognition via sparse spatio-temporal features

ICCCN '05 Proceedings of the 14th International Conference on Computer Communications and Networks
Actions as Space-Time Shapes

IEEE Transactions on Pattern Analysis and Machine Intelligence
Visual learning and recognition of sequential data manifolds with applications to human movement analysis

Computer Vision and Image Understanding
Robust Face Recognition via Sparse Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Distance Metric Learning for Large Margin Nearest Neighbor Classification

The Journal of Machine Learning Research
Action categorization with modified hidden conditional random field

Pattern Recognition
Locality sensitive discriminant analysis

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Incremental discriminant-analysis of canonical correlations for action recognition

Pattern Recognition
Inferring 3D body pose from silhouettes using activity manifold learning

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Learning and Matching of Dynamic Shape Manifolds for Human Action Recognition

IEEE Transactions on Image Processing

Ongoing human action recognition with motion capture

Pattern Recognition

Quantified Score

Hi-index	0.01

Visualization

Abstract

Human action recognition is an important issue in the pattern recognition field, with applications ranging from remote surveillance to the indexing of commercial video content. However, human actions are characterized by non-linear dynamics and are therefore not easily learned and recognized. Accordingly, this study proposes a silhouette-based human action recognition system in which a three-step procedure is used to construct an efficient discriminant spatio-temporal subspace for k-NN classification purposes. In the first step, an Adaptive Locality Preserving Projection (ALPP) method is proposed to obtain a low-dimensional spatial subspace in which the linearity in the local data structure is preserved. To resolve the problem of overlaps in the spatial subspace resulting from the ambiguity of the human body shape among different action classes, temporal data are extracted using a Non-base Central-Difference Action Vector (NCDAV) method. Finally, the Large Margin Nearest Neighbor (LMNN) metric learning method is applied to construct an efficient spatio-temporal subspace for classification purposes. The experimental results show that the proposed system accurately recognizes a variety of human actions in real time and outperforms most existing methods. In addition, a robustness test with noisy data indicates that our system is remarkably robust toward noise in the input images.