Modeling the temporal extent of actions

Authors:
Scott Satkin;Martial Hebert
Affiliations:
Carnegie Mellon University, The Robotics Institute;Carnegie Mellon University, The Robotics Institute
Venue:
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Year:
2010

Citing 10
Cited 7

Original Contribution: Stacked generalization

Neural Networks
Solving the multiple instance problem with axis-parallel rectangles

Artificial Intelligence
Segmenting film sequences using active surfaces

ICIP '97 Proceedings of the 1997 International Conference on Image Processing (ICIP '97) 3-Volume Set-Volume 1 - Volume 1
Probability Estimates for Multi-class Classification by Pairwise Coupling

The Journal of Machine Learning Research
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Space-Time Behavior Based Correlation

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Interactive video cutout

ACM SIGGRAPH 2005 Papers
Actions as Space-Time Shapes

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Object Recognition by Integrating Multiple Image Segmentations

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
Movie/Script: Alignment and Parsing of Video and Text Transcription

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part IV

Are current monocular computer vision systems for human action recognition suitable for visual surveillance applications?

ISVC'11 Proceedings of the 7th international conference on Advances in visual computing - Volume Part II
Multiple instance learning for classification of human behavior observations

ACII'11 Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part I
Kernelized temporal cut for online temporal segmentation and recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Learning latent spatio-temporal compositional model for human action recognition

Proceedings of the 21st ACM international conference on Multimedia
Human action recognition with salient trajectories

Signal Processing
Object and Action Classification with Latent Window Parameters

International Journal of Computer Vision
Max-Margin Early Event Detectors

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a framework for estimating what portions of videos are most discriminative for the task of action recognition. We explore the impact of the temporal cropping of training videos on the overall accuracy of an action recognition system, and we formalize what makes a set of croppings optimal. In addition, we present an algorithm to determine the best set of croppings for a dataset, and experimentally show that our approach increases the accuracy of various state-of-the-art action recognition techniques.