Action recognition in video by sparse representation on covariance manifolds of silhouette tunnels

Authors:
Kai Guo;Prakash Ishwar;Janusz Konrad
Affiliations:
Department of Electrical and Computer Engineering, Boston University,Saint Mary's St., Boston, MA;Department of Electrical and Computer Engineering, Boston University,Saint Mary's St., Boston, MA;Department of Electrical and Computer Engineering, Boston University,Saint Mary's St., Boston, MA
Venue:
ICPR'10 Proceedings of the 20th International conference on Recognizing patterns in signals, speech, images, and videos
Year:
2010

Citing 8
Cited 6

Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Behavior recognition via sparse spatio-temporal features

ICCCN '05 Proceedings of the 14th International Conference on Computer Communications and Networks
Actions as Space-Time Shapes

IEEE Transactions on Pattern Analysis and Machine Intelligence
Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

International Journal of Computer Vision
Robust Face Recognition via Sparse Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Human Action Recognition in Videos Using Kinematic Features and Multiple Instance Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Action Recognition in Video by Covariance Matching of Silhouette Tunnels

SIBGRAPI '09 Proceedings of the 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing
Action Recognition from One Example

IEEE Transactions on Pattern Analysis and Machine Intelligence

An overview of contest on semantic description of human activities (SDHA) 2010

ICPR'10 Proceedings of the 20th International conference on Recognizing patterns in signals, speech, images, and videos
Human action recognition using Pose-based discriminant embedding

Image Communication
Using a Product Manifold distance for unsupervised action recognition

Image and Vision Computing
Human action recognition using a fast learning fully complex-valued classifier

Neurocomputing
Advances in matrix manifolds for computer vision

Image and Vision Computing
Semi-supervised action recognition in video via Labeled Kernel Sparse Coding and sparse L1 graph

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

A novel framework for action recognition in video using empirical covariance matrices of bags of low-dimensional feature vectors is developed. The feature vectors are extracted from segments of silhouette tunnels of moving objects and coarsely capture their shapes. The matrix logarithm is used to map the segment covariance matrices, which live in a nonlinear Riemannian manifold, to the vector space of symmetric matrices. A recently developed sparse linear representation framework for dictionary-based classification is then applied to the log-covariance matrices. The log-covariance matrix of a query segment is approximated by a sparse linear combination of the log-covariance matrices of training segments and the sparse coefficients are used to determine the action label of the query segment. This approach is tested on the Weizmann and the UT-Tower human action datasets. The new approach attains a segment-level classification rate of 96.74% for the Weizmann dataset and 96.15% for the UT-Tower dataset. Additionally, the proposed method is computationally and memory efficient and easy to implement.