Combining per-frame and per-track cues for multi-person action recognition

Authors:
Sameh Khamis;Vlad I. Morariu;Larry S. Davis
Affiliations:
University of Maryland, College Park;University of Maryland, College Park;University of Maryland, College Park
Venue:
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Year:
2012

Citing 15
Cited 0

Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Actions as Space-Time Shapes

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Beyond Tracking: Modelling Activity and Understanding Behaviour

International Journal of Computer Vision
Learning, detection and representation of multi-agent events in videos

Artificial Intelligence
Fast solvers and efficient implementations for distance metric learning

Proceedings of the 25th international conference on Machine learning
LIBLINEAR: A Library for Large Linear Classification

The Journal of Machine Learning Research
Belief propagation for min-cost network flow: convergence & correctness

SODA '10 Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms
Stochastic Representation and Recognition of High-Level Group Activities

International Journal of Computer Vision
Multiple Object Tracking Using K-Shortest Paths Optimization

IEEE Transactions on Pattern Analysis and Machine Intelligence
TextonBoost: joint appearance, shape and context modeling for multi-class object recognition and segmentation

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Globally-optimal greedy algorithms for tracking a variable number of objects

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Multi-agent event recognition in structured scenarios

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Multiobject tracking as maximum weight independent set

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Probabilistic event logic for interval-based event recognition

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a model to combine per-frame and per-track cues for action recognition. With multiple targets in a scene, our model simultaneously captures the natural harmony of an individual's action in a scene and the flow of actions of an individual in a video sequence, inferring valid tracks in the process. Our motivation is based on the unlikely discordance of an action in a structured scene, both at the track level and the frame level (e.g., a person dancing in a crowd of joggers). While we can utilize sampling approaches for inference in our model, we instead devise a global inference algorithm by decomposing the problem and solving the subproblems exactly and efficiently, recovering a globally optimal joint solution in several cases. Finally, we improve on the state-of-the-art action recognition results for two publicly available datasets.