Large scale continuous visual event recognition using max-margin Hough transformation framework

Authors:
Bhaskar Chakraborty;Jordi Gonzílez;F. Xavier Roca
Affiliations:
-;-;-
Venue:
Computer Vision and Image Understanding
Year:
2013

Citing 30
Cited 0

Efficient Component Labeling of Images of Arbitrary Dimension Represented by Linear Bintrees

IEEE Transactions on Pattern Analysis and Machine Intelligence
Support-Vector Networks

Machine Learning
Use of the Hough transformation to detect lines and curves in pictures

Communications of the ACM
A survey of computer vision-based human motion capture

Computer Vision and Image Understanding - Modeling people toward vision-based underatanding of a person's shape, appearance, and movement
Learning variable-length Markov models of behavior

Computer Vision and Image Understanding - Modeling people toward vision-based underatanding of a person's shape, appearance, and movement
Mean Shift: A Robust Approach Toward Feature Space Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Representation and Recognition of Human Movement Using Temporal Templates

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Recognizing Action at a Distance

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Space-Time Behavior Based Correlation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Actions Sketch: A Novel Action Representation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Learning and Detecting Activities from Movement Trajectories Using the Hierarchical Hidden Markov Models

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
On Space-Time Interest Points

International Journal of Computer Vision
Behavior recognition via sparse spatio-temporal features

ICCCN '05 Proceedings of the 14th International Conference on Computer Communications and Networks
A 3-dimensional sift descriptor and its application to action recognition

Proceedings of the 15th international conference on Multimedia
Actions as Space-Time Shapes

IEEE Transactions on Pattern Analysis and Machine Intelligence
Robust Object Detection with Interleaved Categorization and Segmentation

International Journal of Computer Vision
Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

International Journal of Computer Vision
Learning an Alphabet of Shape and Appearance for Multi-Class Object Detection

International Journal of Computer Vision
An Efficient Dense and Scale-Invariant Spatio-Temporal Interest Point Detector

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part II
Motion Segmentation: a Review

Proceedings of the 2008 conference on Artificial Intelligence Research and Development: Proceedings of the 11th International Conference of the Catalan Association for Artificial Intelligence
A survey on vision-based human action recognition

Image and Vision Computing
Human activity analysis: A review

ACM Computing Surveys (CSUR)
Action recognition: A region based approach

WACV '11 Proceedings of the 2011 IEEE Workshop on Applications of Computer Vision (WACV)
Hough Forests for Object Detection, Tracking, and Action Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Unsupervised random forest indexing for fast action search

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
A large-scale benchmark dataset for event recognition in surveillance video

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Machine Recognition of Human Activities: A Survey

IEEE Transactions on Circuits and Systems for Video Technology
A selective spatio-temporal interest point detector for human action recognition in complex scenes

ICCV '11 Proceedings of the 2011 International Conference on Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we propose a novel method for continuous visual event recognition (CVER) on a large scale video dataset using max-margin Hough transformation framework. Due to high scalability, diverse real environmental state and wide scene variability direct application of action recognition/detection methods such as spatio-temporal interest point (STIP)-local feature based technique, on the whole dataset is practically infeasible. To address this problem, we apply a motion region extraction technique which is based on motion segmentation and region clustering to identify possible candidate ''event of interest'' as a preprocessing step. On these candidate regions a STIP detector is applied and local motion features are computed. For activity representation we use generalized Hough transform framework where each feature point casts a weighted vote for possible activity class centre. A max-margin frame work is applied to learn the feature codebook weight. For activity detection, peaks in the Hough voting space are taken into account and initial event hypothesis is generated using the spatio-temporal information of the participating STIPs. For event recognition a verification Support Vector Machine is used. An extensive evaluation on benchmark large scale video surveillance dataset (VIRAT) and as well on a small scale benchmark dataset (MSR) shows that the proposed method is applicable on a wide range of continuous visual event recognition applications having extremely challenging conditions.