Learning, detection and representation of multi-agent events in videos

Authors:
Asaad Hakeem;Mubarak Shah
Affiliations:
Computer Vision Lab, School of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816, USA;Computer Vision Lab, School of Electrical Engineering and Computer Science, University of Central Florida, Orlando, FL 32816, USA
Venue:
Artificial Intelligence
Year:
2007

Citing 35
Cited 10

Optimal template matching by nonorthogonal image expansion using restoration

Machine Vision and Applications
Optimal Ramp Edge Detection Using Expansion Matching

IEEE Transactions on Pattern Analysis and Machine Intelligence
Pfinder: Real-Time Tracking of the Human Body

IEEE Transactions on Pattern Analysis and Machine Intelligence
A framework for recognizing multi-agent action from visual evidence

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Discovery and Segmentation of Activities in Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
Recognition of Visual Activities and Interactions by Stochastic Parsing

IEEE Transactions on Pattern Analysis and Machine Intelligence
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Event Detection and Analysis from Video Streams

IEEE Transactions on Pattern Analysis and Machine Intelligence
View-Invariant Representation and Recognition of Actions

International Journal of Computer Vision
Human Activity Recognition Using Multidimensional Indexing

IEEE Transactions on Pattern Analysis and Machine Intelligence
Tracking and Object Classification for Automated Surveillance

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Visual Event Classification via Force Dynamics

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Kernel-Based Object Tracking

IEEE Transactions on Pattern Analysis and Machine Intelligence
Human Action Detection Using PNF Propagation of Temporal Constraints

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Towards the Computational Perception of Action

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Understanding manipulation in video

FG '96 Proceedings of the 2nd International Conference on Automatic Face and Gesture Recognition (FG '96)
Recognition of Group Activities using Dynamic Probabilistic Networks

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Event Detection from Continuous Media

ICPR '98 Proceedings of the 14th International Conference on Pattern Recognition-Volume 2 - Volume 2
Ontology and Taxonomy Collaborated Framework for Meeting Classification

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 4 - Volume 04
A Noniterative Greedy Algorithm for Multiframe Point Correspondence

IEEE Transactions on Pattern Analysis and Machine Intelligence
Video-based event recognition: activity representation and probabilistic recognition methods

Computer Vision and Image Understanding - Special issue on event detection in video
Human action-recognition using mutual invariants

Computer Vision and Image Understanding
View Invariance for Human Action Recognition

International Journal of Computer Vision
Learning Temporal Sequence Model from Partially Labeled Data

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
CASEE: a hierarchical event representation for the analysis of videos

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Multiple agent event detection and representation in videos

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Towards ontology based cognitive vision

ICVS'03 Proceedings of the 3rd international conference on Computer vision systems
Fast, integrated person tracking and activity recognition with plan-view templates from a single stereo camera

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Detecting unusual activity in video

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Propagation networks for recognition of partially ordered sequential action

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Integrating and employing multiple levels of zoom for activity recognition

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Learning the structure of dynamic probabilistic networks

UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence
Recognition and segmentation of 3-d human action using HMM and multi-class adaboost

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
"Shape Activity": a continuous-state HMM for moving/deforming shapes with application to abnormal activity detection

IEEE Transactions on Image Processing
A semantic event-detection approach and its application to detecting hunts in wildlife video

IEEE Transactions on Circuits and Systems for Video Technology

Multi-agent System for Recognition of Hand Postures

ICCS 2009 Proceedings of the 9th International Conference on Computational Science
Logic-based representation, reasoning and machine learning for event recognition

Proceedings of the Fourth ACM International Conference on Distributed Event-Based Systems
A logic programming approach to activity recognition

Proceedings of the 2nd ACM international workshop on Events in multimedia
Approximate reasoning and finite state machines to the detection of actions in video sequences

International Journal of Approximate Reasoning
On complex event processing for real-time situational awareness

RuleML'2011 Proceedings of the 5th international conference on Rule-based reasoning, programming, and applications
The retrieval of motion event by associations of temporal frequent pattern growth

Future Generation Computer Systems
Combining per-frame and per-track cues for multi-person action recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Recognizing Interactive Group Activities Using Temporal Interaction Matrices and Their Riemannian Statistics

International Journal of Computer Vision
An interactive personalized video summarization based on sketches

Proceedings of the 12th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and Its Applications in Industry
Random walks in directed hypergraphs and application to semi-supervised image segmentation

Computer Vision and Image Understanding

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we model multi-agent events in terms of a temporally varying sequence of sub-events, and propose a novel approach for learning, detecting and representing events in videos. The proposed approach has three main steps. First, in order to learn the event structure from training videos, we automatically encode the sub-event dependency graph, which is the learnt event model that depicts the conditional dependency between sub-events. Second, we pose the problem of event detection in novel videos as clustering the maximally correlated sub-events using normalized cuts. The principal assumption made in this work is that the events are composed of a highly correlated chain of sub-events that have high weights (association) within the cluster and relatively low weights (disassociation) between the clusters. The event detection does not require prior knowledge of the number of agents involved in an event and does not make any assumptions about the length of an event. Third, we recognize the fact that any abstract event model should extend to representations related to human understanding of events. Therefore, we propose an extension of CASE representation of natural languages that allows a plausible means of interface between users and the computer. We show results of learning, detection, and representation of events for videos in the meeting, surveillance, and railroad monitoring domains.