Spatio-temporal video representation with locality-constrained linear coding

Authors:
Manal Al Ghamdi;Nouf Al Harbi;Yoshihiko Gotoh
Affiliations:
University of Sheffield, UK;University of Sheffield, UK;University of Sheffield, UK
Venue:
ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Year:
2012

Citing 15
Cited 0

Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
On Space-Time Interest Points

International Journal of Computer Vision
Actions as Space-Time Shapes

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
A 3-dimensional sift descriptor and its application to action recognition

Proceedings of the 15th international conference on Multimedia
Space-Time Behavior-Based Correlation—OR—How to Tell If Two Underlying Motion Fields Are Similar Without Computing Them?

IEEE Transactions on Pattern Analysis and Machine Intelligence
An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part II
Spatio-Temporal Frames in a Bag-of-Visual-Features Approach for Human Actions Recognition

SIBGRAPI '09 Proceedings of the 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing
Vlfeat: an open and portable library of computer vision algorithms

Proceedings of the international conference on Multimedia
Making action recognition robust to occlusions and viewpoint changes

ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
An evaluation of bags-of-words and spatio-temporal shapes for action recognition

WACV '11 Proceedings of the 2011 IEEE Workshop on Applications of Computer Vision (WACV)
Robust sparse coding for face recognition

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Action recognition in videos acquired by a moving camera using motion decomposition of Lagrangian particle trajectories

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Action recognition using rank-1 approximation of Joint Self-Similarity Volume

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a spatio-temporal coding technique for a video sequence. The framework is based on a space-time extension of scale-invariant feature transform (SIFT) combined with locality-constrained linear coding (LLC). The coding scheme projects each spatio-temporal descriptor into a local coordinate representation produced by max pooling. The extension is evaluated using human action classification tasks. Experiments with the KTH, Weizmann, UCF sports and Hollywood datasets indicate that the approach is able to produce results comparable to the state-of-the-art.