Steps toward a cognitive vision system

Authors:
Hans-Hellmut Nagel
Affiliations:
-
Venue:
AI Magazine
Year:
2004

Citing 20
Cited 26

From image sequences towards conceptual descriptions

Image and Vision Computing
Fitting Parameterized Three-Dimensional Models to Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Model-based object tracking in monocular image sequences of road traffic scenes

International Journal of Computer Vision
Artificial intelligence: a modern approach

Artificial intelligence: a modern approach
Estimation of optical flow based on higher-order spatiotemporal derivatives in interlaced and non-interlaced image sequences

Artificial Intelligence - Special volume on computer vision
Visual surveillance in a dynamic and uncertain world

Artificial Intelligence - Special volume on computer vision
Picture interpretation: a symbolic approach

Picture interpretation: a symbolic approach
3D Pose Estimation by Directly Matching Polyhedral Models to Gray Value Gradients

International Journal of Computer Vision
Combination of Edge Element and Optical Flow Estimates for 3D-Model-Based Vehicle Tracking in Traffic Image Sequences

International Journal of Computer Vision
Understanding dynamic scenes

Artificial Intelligence
Evaluating Natural Language Processing Systems: An Analysis and Review

Evaluating Natural Language Processing Systems: An Analysis and Review
Natural Language Description of Human Activities from Video Images Based on Concept Hierarchy of Actions

International Journal of Computer Vision
(Mis?-) Using DRT for Generation of Natural Language Text from Image Sequences

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume II - Volume II
Association of Motion Verbs with Vehicle Movements Extracted from Dense Optical Flow Fields

ECCV '94 Proceedings of the Third European Conference-Volume II on Computer Vision - Volume II
Tracking with the EM Contour Algorithm

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Eine Familie von Bildmerkmalen für die Bewegungsbestimmung in Bildfolgen

Proceedings of the DAGM/ÖAGM Symposium
Detektion und Verfolgung mehrerer Objekte in Bildfolgen

Mustererkennung 1986, 8. DAGM-Symposium
Detektion und Verfolgung von Fahrzeugen in Straßenverkehrsszenen: Systematische Bewertung und Steigerung der Systemleistung

Mustererkennung 1995, 17. DAGM-Symposium
Image Sequence Evaluation: 30 Years and Still Going Strong

ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 1
Agent Orientated Annotation in Model Based Visual Surveillance

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision

A Framework for Model-Based Tracking Experiments in Image Sequences

International Journal of Computer Vision
Cognitive vision: The case for embodied perception

Image and Vision Computing
Representation of occurrences for road vehicle traffic

Artificial Intelligence
A cognitive vision approach to early pest detection in greenhouse crops

Computers and Electronics in Agriculture
Enabling location and environment awareness in cognitive radios

Computer Communications
Interpretation of complex situations in a semantic-based surveillance framework

Image Communication
Initialization of Model-Based Vehicle Tracking in Video Sequences of Inner-City Intersections

International Journal of Computer Vision
Towards a Semi-automatic Situation Diagnosis System in Surveillance Tasks

IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Segmentation of Moving Objects with Information Feedback Between Description Levels

IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Natural Language Descriptions of Human Behavior from Video Sequences

KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
On the effect of feedback in multilevel representation spaces for visual surveillance tasks

Neurocomputing
Understanding dynamic scenes based on human sequence evaluation

Image and Vision Computing
Reasoning about Movement in Two-Dimensions

Canadian AI '09 Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence
Reasoning about Dynamic Depth Profiles

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Knowledge and Event-Based System for Video-Surveillance Tasks

IWINAC '09 Proceedings of the 3rd International Work-Conference on The Interplay Between Natural and Artificial Computation: Part I: Methods and Models in Artificial and Natural Computation. A Homage to Professor Mira's Scientific Legacy
Guest editorial: introducing perception, planning, and navigation for intelligent vehicles

IEEE Transactions on Intelligent Transportation Systems
Low-cost portable text recognition and speech synthesis with generic laptop computer, digital camera and software

UAHCI'07 Proceedings of the 4th international conference on Universal access in human-computer interaction: ambient interaction
Dialog-based 3D-image recognition using a domain ontology

SC'06 Proceedings of the 2006 international conference on Spatial Cognition V: reasoning, action, interaction
An annotation tool for video understanding

EUROCAST'07 Proceedings of the 11th international conference on Computer aided systems theory
Vision, logic, and language - toward analyzable encompassing systems

KI'10 Proceedings of the 33rd annual German conference on Advances in artificial intelligence
Logic-based trajectory evaluation in videos

KI'10 Proceedings of the 33rd annual German conference on Advances in artificial intelligence
Augmenting video surveillance footage with virtual agents for incremental event evaluation

Pattern Recognition Letters
Efficient incorporation of motionless foreground objects for adaptive background segmentation

AMDO'06 Proceedings of the 4th international conference on Articulated Motion and Deformable Objects
Describing video contents in natural language

HYBRID '12 Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
Rule-based high-level situation recognition from incomplete tracking data

RuleML'12 Proceedings of the 6th international conference on Rules on the Web: research and applications
Supporting fuzzy metric temporal logic based situation recognition by mean shift clustering

KI'12 Proceedings of the 35th Annual German conference on Advances in Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

An adequate natural language description of developments in a real-world scene can be taken as proof of "understanding what is going on." An algorithmic system that generates natural language descriptions from video recordings of road traffic scenes can be said to "understand" its input to the extent that algorithmically generated text is acceptable to the humans judging it. A fuzzy metrictemporal Horn logic (FMTHL) provides a formalism for representing both schematic and instantiated conceptual knowledge about the depicted scene and its temporal development. The resulting conceptual representation mediates in a systematic manner between the spatiotemporal geometric descriptions extracted from video input and a module that generates natural language text. This article outlines a 30-year effort to create such a cognitive vision system, indicates its current status, summarizes lessons learned along the way, and discusses open problems against this background.