Feature detector and descriptor evaluation in human action recognition

Authors:
Ling Shao;Riccardo Mattivi
Affiliations:
The University of Sheffield, Sheffield, UK and Shenzhen Institute of Advanced Integration Technology, CAS/CUHK, China;University of Trento, Povo (TN) - Italy
Venue:
Proceedings of the ACM International Conference on Image and Video Retrieval
Year:
2010

Citing 18
Cited 14

Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns

IEEE Transactions on Pattern Analysis and Machine Intelligence
Recognizing Action at a Distance

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Space-Time Behavior Based Correlation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
A Bayesian Hierarchical Model for Learning Natural Scene Categories

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Object Recognition with Features Inspired by Visual Cortex

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
On Space-Time Interest Points

International Journal of Computer Vision
A Comparison of Affine Region Detectors

International Journal of Computer Vision
Face Description with Local Binary Patterns: Application to Face Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Behavior recognition via sparse spatio-temporal features

ICCCN '05 Proceedings of the 14th International Conference on Computer Communications and Networks
Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions

IEEE Transactions on Pattern Analysis and Machine Intelligence
A 3-dimensional sift descriptor and its application to action recognition

Proceedings of the 15th international conference on Multimedia
Description of interest regions with local binary patterns

Pattern Recognition
Human Action Recognition Using LBP-TOP as Sparse Spatio-Temporal Feature Descriptor

CAIP '09 Proceedings of the 13th International Conference on Computer Analysis of Images and Patterns
Description of interest regions with center-symmetric local binary patterns

ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Local descriptors for spatio-temporal recognition

SCVMA'04 Proceedings of the First international conference on Spatial Coherence for Visual Motion Analysis

Transform based spatio-temporal descriptors for human action recognition

Neurocomputing
Recent advances and trends in visual tracking: A review

Neurocomputing
Action retrieval with relevance feedback on YouTube videos

Proceedings of the Third International Conference on Internet Multimedia Computing and Service
Relevance feedback for real-world human action retrieval

Pattern Recognition Letters
Human action recognition using a fast learning fully complex-valued classifier

Neurocomputing
Sparse Modeling of Human Actions from Motion Imagery

International Journal of Computer Vision
Spatio-temporal SIFT and its application to human action classification

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I
Human action recognition based on boosted feature selection and naive Bayes nearest-neighbor classification

Signal Processing
Content-based retrieval of human actions from realistic video databases

Information Sciences: an International Journal
STV-based video feature processing for action recognition

Signal Processing
Combining appearance and structural features for human action recognition

Neurocomputing
A local descriptor based on Laplacian pyramid coding for action recognition

Pattern Recognition Letters
Active learning for human action retrieval using query pool selection

Neurocomputing
Sparse Discriminative Information Preservation for Chinese character font categorization

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we evaluate and compare different feature detection and feature description methods for part-based approaches in human action recognition. Different methods have been proposed in the literature for both feature detection of space-time interest points and description of local video patches. It is however unclear which method performs better in the field of human action recognition. We compare, in the feature detection section, Dollar's method [18], Laptev's method [22], a bank of 3D-Gabor filters [6] and a method based on Space-Time Differences of Gaussians. We also compare and evaluate different descriptors such as Gradient [18], HOG-HOF [22], 3D SIFT [24] and an enhanced version of LBP-TOP [15]. We show the combination of Dollar's detection method and the improved LBP-TOP descriptor to be computationally efficient and to reach the best recognition accuracy on the KTH database.