Ontology-driven semantic video analysis using visual information objects

  • Authors:
  • Georgios Th. Papadopoulos;Vasileios Mezaris;Ioannis Kompatsiaris;Michael G. Strintzis

  • Affiliations:
  • Information Processing Laboratory, Electrical and Computer Engineering Department, Aristotle University of Thessaloniki, Greece and Informatics and Telematics Institute, CERTH, Thessaloniki, Greec ...;Informatics and Telematics Institute, CERTH, Thessaloniki, Greece;Informatics and Telematics Institute, CERTH, Thessaloniki, Greece;Information Processing Laboratory, Electrical and Computer Engineering Department, Aristotle University of Thessaloniki, Greece and Informatics and Telematics Institute, CERTH, Thessaloniki, Greec ...

  • Venue:
  • SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, an ontology-driven approach for the semantic analysis of video is proposed. This approach builds on an ontology infrastructure and in particular a multimedia ontology that is based on the notions of Visual Information Object (VIO) and Multimedia Information Object (MMIO). The latter constitute extensions of the Information Object (IO) design pattern, previously proposed for refining and extending the DOLCE core ontology. This multimedia ontology, along with the more domain-specific parts of the developed knowledge infrastructure, supports the analysis of video material, models the content layer of video, and defines generic as well as domain-specific concepts whose detection is important for the analysis and description of video of the specified domain. The signal-level video processing that is necessary for linking the developed ontology infrastructure with the signal domain includes the combined use of a temporal and a spatial segmentation algorithm, a layered structure of Support Vector Machines (SVMs)-based classifiers and a classifier fusion mechanism. A Genetic Algorithm (GA) is introduced for optimizing the performed information fusion step. These processing methods support the decomposition of visual information, as specified by the multimedia ontology, and the detection of the defined domain-specific concepts that each piece of video signal, treated as a VIO, is related to. Experimental results in the domain of disaster news video demonstrate the efficiency of the proposed approach.