Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Recognizing Human Actions: A Local SVM Approach
ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
International Journal of Computer Vision
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
A 3-dimensional sift descriptor and its application to action recognition
Proceedings of the 15th international conference on Multimedia
IEEE Transactions on Pattern Analysis and Machine Intelligence
An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part II
Spatio-Temporal Frames in a Bag-of-Visual-Features Approach for Human Actions Recognition
SIBGRAPI '09 Proceedings of the 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing
Vlfeat: an open and portable library of computer vision algorithms
Proceedings of the international conference on Multimedia
Making action recognition robust to occlusions and viewpoint changes
ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
An evaluation of bags-of-words and spatio-temporal shapes for action recognition
WACV '11 Proceedings of the 2011 IEEE Workshop on Applications of Computer Vision (WACV)
Robust sparse coding for face recognition
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Action recognition using rank-1 approximation of Joint Self-Similarity Volume
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Hi-index | 0.00 |
This paper presents a spatio-temporal coding technique for a video sequence. The framework is based on a space-time extension of scale-invariant feature transform (SIFT) combined with locality-constrained linear coding (LLC). The coding scheme projects each spatio-temporal descriptor into a local coordinate representation produced by max pooling. The extension is evaluated using human action classification tasks. Experiments with the KTH, Weizmann, UCF sports and Hollywood datasets indicate that the approach is able to produce results comparable to the state-of-the-art.