An event detection framework in video sequences based on hierarchic event structure perception

Authors:
Lang Congyan;Xu De
Affiliations:
Institute of Computer Science, Beijing Jiaotong University, Beijing, China;Institute of Computer Science, Beijing Jiaotong University, Beijing, China
Venue:
ISPRA'06 Proceedings of the 5th WSEAS International Conference on Signal Processing, Robotics and Automation
Year:
2006

Citing 9
Cited 0

Parameterized modeling and recognition of activities

Computer Vision and Image Understanding
A Bayesian Computer Vision System for Modeling Human Interactions

IEEE Transactions on Pattern Analysis and Machine Intelligence
Recognizing planned multiperson action

Computer Vision and Image Understanding - Modeling people toward vision-based underatanding of a person's shape, appearance, and movement
Event Detection and Analysis from Video Streams

IEEE Transactions on Pattern Analysis and Machine Intelligence
Mean Shift: A Robust Approach Toward Feature Space Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
Video-based event recognition: activity representation and probabilistic recognition methods

Computer Vision and Image Understanding - Special issue on event detection in video
HMM based structuring of tennis videos using visual and audio cues

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 3 (ICME '03) - Volume 03
Multimedia event-based video indexing using time intervals

IEEE Transactions on Multimedia
Automatic soccer video analysis and summarization

IEEE Transactions on Image Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose a framework for event detection based on hierarchic event structure perception. In order to modeling and recognizing semantic event, it is necessary to organize the spatial and temporal visual information into a meaningful representation. The main purpose of this paper is not to detect event in special domain, but to construct general event detection framework in perceptual manner and to provide meaningful unit in different semantic granularity. Specially, in the first stage fine-grained segmentation is preformed by bottom up processing that characteristics of salient regions serve as direct cues to identify temporal boundaries. Furthermore, top-down recognition module detects coarse-grain event by using HMMs to combine prior knowledge with spatio-temporal descriptors of fine-grain unit. The experimental results using different types of video sequences are presented to demonstrate the efficiency and accuracy of our proposed algorithm.