Recognizing Human Actions: A Local SVM Approach
ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Histograms of Oriented Gradients for Human Detection
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Beyond Tracking: Modelling Activity and Understanding Behaviour
International Journal of Computer Vision
Learning, detection and representation of multi-agent events in videos
Artificial Intelligence
Fast solvers and efficient implementations for distance metric learning
Proceedings of the 25th international conference on Machine learning
LIBLINEAR: A Library for Large Linear Classification
The Journal of Machine Learning Research
Belief propagation for min-cost network flow: convergence & correctness
SODA '10 Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms
Stochastic Representation and Recognition of High-Level Group Activities
International Journal of Computer Vision
Multiple Object Tracking Using K-Shortest Paths Optimization
IEEE Transactions on Pattern Analysis and Machine Intelligence
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
Globally-optimal greedy algorithms for tracking a variable number of objects
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Multi-agent event recognition in structured scenarios
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Multiobject tracking as maximum weight independent set
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Probabilistic event logic for interval-based event recognition
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Hi-index | 0.00 |
We propose a model to combine per-frame and per-track cues for action recognition. With multiple targets in a scene, our model simultaneously captures the natural harmony of an individual's action in a scene and the flow of actions of an individual in a video sequence, inferring valid tracks in the process. Our motivation is based on the unlikely discordance of an action in a structured scene, both at the track level and the frame level (e.g., a person dancing in a crowd of joggers). While we can utilize sampling approaches for inference in our model, we instead devise a global inference algorithm by decomposing the problem and solving the subproblems exactly and efficiently, recovering a globally optimal joint solution in several cases. Finally, we improve on the state-of-the-art action recognition results for two publicly available datasets.