Determining the best suited semantic events for cognitive surveillance

Authors:
C. Fernández;P. Baiget;F. X. Roca;J. Gonzílez
Affiliations:
Computer Vision Center, UAB, Edifici O, Campus UAB, 08193 Barcelona, Spain;Computer Vision Center, UAB, Edifici O, Campus UAB, 08193 Barcelona, Spain;Computer Vision Center, UAB, Edifici O, Campus UAB, 08193 Barcelona, Spain;Computer Vision Center, UAB, Edifici O, Campus UAB, 08193 Barcelona, Spain
Venue:
Expert Systems with Applications: An International Journal
Year:
2011

Citing 19
Cited 3

Formal ontology, conceptual analysis and knowledge representation

International Journal of Human-Computer Studies - Special issue: the role of formal ontology in the information technology
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
A survey on tree edit distance and related problems

Theoretical Computer Science
Beyond Tracking: Modelling Activity and Understanding Behaviour

International Journal of Computer Vision
Surveillance Event Interpretation Using Generalized Stochastic Petri Nets

WIAMIS '07 Proceedings of the Eight International Workshop on Image Analysis for Multimedia Interactive Services
Video understanding for complex activity recognition

Machine Vision and Applications
Representation of occurrences for road vehicle traffic

Artificial Intelligence
Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

International Journal of Computer Vision
Road-traffic monitoring by knowledge-driven static and dynamic image analysis

Expert Systems with Applications: An International Journal
Interpretation of complex situations in a semantic-based surveillance framework

Image Communication
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multi-thread Parsing for Recognizing Complex Events in Videos

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
Automatic scene detection for advanced story retrieval

Expert Systems with Applications: An International Journal
A cognitive surveillance system for detecting incorrect traffic behaviors

Expert Systems with Applications: An International Journal
A Context Model and Reasoning System to improve object trackingin complex scenarios

Expert Systems with Applications: An International Journal
Understanding dynamic scenes based on human sequence evaluation

Image and Vision Computing
Expert system for color image retrieval

Expert Systems with Applications: An International Journal
Automatic detection and indexing of video-event shots for surveillance applications

IEEE Transactions on Multimedia
A Constrained Probabilistic Petri Net Framework for Human Activity Detection in Video

IEEE Transactions on Multimedia

Human activity monitoring by local and global finite state machines

Expert Systems with Applications: An International Journal
The retrieval of motion event by associations of temporal frequent pattern growth

Future Generation Computer Systems
W3-privacy: understanding what, when, and where inference channels in multi-camera surveillance video

Multimedia Tools and Applications

Quantified Score

Hi-index	12.05

Visualization

Abstract

State-of-the-art systems on cognitive surveillance identify and describe complex events in selected domains, thus providing end-users with tools to easily access the contents of massive video footage. Nevertheless, as the complexity of events increases in semantics and the types of indoor/outdoor scenarios diversify, it becomes difficult to assess which events describe better the scene, and how to model them at a pixel level to fulfill natural language requests. We present an ontology-based methodology that guides the identification, step-by-step modeling, and generalization of the most relevant events to a specific domain. Our approach considers three steps: (1) end-users provide textual evidence from surveilled video sequences; (2) transcriptions are analyzed top-down to build the knowledge bases for event description; and (3) the obtained models are used to generalize event detection to different image sequences from the surveillance domain. This framework produces user-oriented knowledge that improves on existing advanced interfaces for video indexing and retrieval, by determining the best suited events for video understanding according to end-users. We have conducted experiments with outdoor and indoor scenes showing thefts, chases, and vandalism, demonstrating the feasibility and generalization of this proposal.