From image sequences towards conceptual descriptions
Image and Vision Computing
Fitting Parameterized Three-Dimensional Models to Images
IEEE Transactions on Pattern Analysis and Machine Intelligence
Model-based object tracking in monocular image sequences of road traffic scenes
International Journal of Computer Vision
Artificial intelligence: a modern approach
Artificial intelligence: a modern approach
Artificial Intelligence - Special volume on computer vision
Visual surveillance in a dynamic and uncertain world
Artificial Intelligence - Special volume on computer vision
Picture interpretation: a symbolic approach
Picture interpretation: a symbolic approach
3D Pose Estimation by Directly Matching Polyhedral Models to Gray Value Gradients
International Journal of Computer Vision
International Journal of Computer Vision
Artificial Intelligence
Evaluating Natural Language Processing Systems: An Analysis and Review
Evaluating Natural Language Processing Systems: An Analysis and Review
International Journal of Computer Vision
(Mis?-) Using DRT for Generation of Natural Language Text from Image Sequences
ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume II - Volume II
Association of Motion Verbs with Vehicle Movements Extracted from Dense Optical Flow Fields
ECCV '94 Proceedings of the Third European Conference-Volume II on Computer Vision - Volume II
Tracking with the EM Contour Algorithm
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Eine Familie von Bildmerkmalen für die Bewegungsbestimmung in Bildfolgen
Proceedings of the DAGM/ÖAGM Symposium
Detektion und Verfolgung mehrerer Objekte in Bildfolgen
Mustererkennung 1986, 8. DAGM-Symposium
Mustererkennung 1995, 17. DAGM-Symposium
Image Sequence Evaluation: 30 Years and Still Going Strong
ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 1
Agent Orientated Annotation in Model Based Visual Surveillance
ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
A Framework for Model-Based Tracking Experiments in Image Sequences
International Journal of Computer Vision
Cognitive vision: The case for embodied perception
Image and Vision Computing
Representation of occurrences for road vehicle traffic
Artificial Intelligence
A cognitive vision approach to early pest detection in greenhouse crops
Computers and Electronics in Agriculture
Enabling location and environment awareness in cognitive radios
Computer Communications
Interpretation of complex situations in a semantic-based surveillance framework
Image Communication
Initialization of Model-Based Vehicle Tracking in Video Sequences of Inner-City Intersections
International Journal of Computer Vision
Towards a Semi-automatic Situation Diagnosis System in Surveillance Tasks
IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Segmentation of Moving Objects with Information Feedback Between Description Levels
IWINAC '07 Proceedings of the 2nd international work-conference on Nature Inspired Problem-Solving Methods in Knowledge Engineering: Interplay Between Natural and Artificial Computation, Part II
Natural Language Descriptions of Human Behavior from Video Sequences
KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
Understanding dynamic scenes based on human sequence evaluation
Image and Vision Computing
Reasoning about Movement in Two-Dimensions
Canadian AI '09 Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence
Reasoning about Dynamic Depth Profiles
Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
Knowledge and Event-Based System for Video-Surveillance Tasks
IWINAC '09 Proceedings of the 3rd International Work-Conference on The Interplay Between Natural and Artificial Computation: Part I: Methods and Models in Artificial and Natural Computation. A Homage to Professor Mira's Scientific Legacy
Guest editorial: introducing perception, planning, and navigation for intelligent vehicles
IEEE Transactions on Intelligent Transportation Systems
UAHCI'07 Proceedings of the 4th international conference on Universal access in human-computer interaction: ambient interaction
Dialog-based 3D-image recognition using a domain ontology
SC'06 Proceedings of the 2006 international conference on Spatial Cognition V: reasoning, action, interaction
An annotation tool for video understanding
EUROCAST'07 Proceedings of the 11th international conference on Computer aided systems theory
Vision, logic, and language - toward analyzable encompassing systems
KI'10 Proceedings of the 33rd annual German conference on Advances in artificial intelligence
Logic-based trajectory evaluation in videos
KI'10 Proceedings of the 33rd annual German conference on Advances in artificial intelligence
Augmenting video surveillance footage with virtual agents for incremental event evaluation
Pattern Recognition Letters
Efficient incorporation of motionless foreground objects for adaptive background segmentation
AMDO'06 Proceedings of the 4th international conference on Articulated Motion and Deformable Objects
Describing video contents in natural language
HYBRID '12 Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data
Rule-based high-level situation recognition from incomplete tracking data
RuleML'12 Proceedings of the 6th international conference on Rules on the Web: research and applications
Supporting fuzzy metric temporal logic based situation recognition by mean shift clustering
KI'12 Proceedings of the 35th Annual German conference on Advances in Artificial Intelligence
Hi-index | 0.00 |
An adequate natural language description of developments in a real-world scene can be taken as proof of "understanding what is going on." An algorithmic system that generates natural language descriptions from video recordings of road traffic scenes can be said to "understand" its input to the extent that algorithmically generated text is acceptable to the humans judging it. A fuzzy metrictemporal Horn logic (FMTHL) provides a formalism for representing both schematic and instantiated conceptual knowledge about the depicted scene and its temporal development. The resulting conceptual representation mediates in a systematic manner between the spatiotemporal geometric descriptions extracted from video input and a module that generates natural language text. This article outlines a 30-year effort to create such a cognitive vision system, indicates its current status, summarizes lessons learned along the way, and discusses open problems against this background.