Real-time compressed-domain spatiotemporal segmentation and ontologies for video indexing and retrieval

Authors:
V. Mezaris;I. Kompatsiaris;N. V. Boulgouris;M. G. Strintzis
Affiliations:
Electr. & Comput. Eng. Dept., Aristotle Univ. of Thessaloniki, Greece;-;-;-
Venue:
IEEE Transactions on Circuits and Systems for Video Technology
Year:
2004

Citing 0
Cited 30

Automatic video annotation using ontologies extended with visual information

Proceedings of the 13th annual ACM international conference on Multimedia
Enhanced ontologies for video annotation and retrieval

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Real-time spatiotemporal segmentation of video objects in the H.264 compressed domain

Journal of Visual Communication and Image Representation
Face tracking in the compressed domain

EURASIP Journal on Applied Signal Processing
Lightweight object tracking in compressed video streams demonstrated in region-of-interest coding

EURASIP Journal on Applied Signal Processing
Dynamic pictorial ontologies for video digital libraries annotation

Workshop on multimedia information retrieval on The many faces of multimedia semantics
Multiple moving object detection for fast video content description in compressed domain

EURASIP Journal on Advances in Signal Processing
Multimedia enriched ontologies for video digital libraries

International Journal of Parallel, Emergent and Distributed Systems
Video Semantic Content Analysis Framework Based on Ontology Combined MPEG-7

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Video Object Segmentation Based on Feedback Schemes Guided by a Low-Level Scene Ontology

ACIVS '08 Proceedings of the 10th International Conference on Advanced Concepts for Intelligent Vision Systems
Unsupervised Video Shot Segmentation Using Global Color and Texture Information

ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing
An Approach to Trajectory Estimation of Moving Objects in the H.264 Compressed Domain

PSIVT '09 Proceedings of the 3rd Pacific Rim Symposium on Advances in Image and Video Technology
Automatic objects behaviour recognition from compressed video domain

Image and Vision Computing
Compressed domain indexing of scalable H.264/SVC streams

Image Communication
Real-time moving object segmentation in H.264 compressed domain based on approximate reasoning

International Journal of Approximate Reasoning
Taxonomy of directing semantics for film shot classification

IEEE Transactions on Circuits and Systems for Video Technology
An efficient video indexing and retrieval algorithm using the luminance field trajectory modeling

IEEE Transactions on Circuits and Systems for Video Technology
Multimedia ontology based computational framework for video annotation and retrieval

MCAM'07 Proceedings of the 2007 international conference on Multimedia content analysis and mining
A video retrieval algorithm using random projections

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Instant customized summaries streaming: a service for immediate awareness of new video content

AMR'09 Proceedings of the 7th international conference on Adaptive multimedia retrieval: understanding media and adapting to the user
An engine for content-aware on-line video adaptation

SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
Shot boundary detection algorithm in compressed domain based on adaboost and fuzzy theory

ICNC'06 Proceedings of the Second international conference on Advances in Natural Computation - Volume Part II
Domain knowledge extension with pictorially enriched ontologies

CAIP'05 Proceedings of the 11th international conference on Computer Analysis of Images and Patterns
Object tracking using background subtraction and motion estimation in MPEG videos

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part II
Video scene analysis in 3D wavelet transform domain

Multimedia Tools and Applications
Using knowledge representation languages for video annotation and retrieval

FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems
An ontology infrastructure for multimedia reasoning

VLBV'05 Proceedings of the 9th international conference on Visual Content Processing and Representation
An efficient approach to content-based object retrieval in videos

Neurocomputing
Efficient partial decoding scheme for intra frame in H.264/AVC stream

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Retrieval of high-dimensional visual data: current state, trends and challenges ahead

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a novel algorithm is presented for the real-time, compressed-domain, unsupervised segmentation of image sequences and is applied to video indexing and retrieval. The segmentation algorithm uses motion and color information directly extracted from the MPEG-2 compressed stream. An iterative rejection scheme based on the bilinear motion model is used to effect foreground/background segmentation. Following that, meaningful foreground spatiotemporal objects are formed by initially examining the temporal consistency of the output of iterative rejection, clustering the resulting foreground macroblocks to connected regions and finally performing region tracking. Background segmentation to spatiotemporal objects is additionally performed. MPEG-7 compliant low-level descriptors describing the color, shape, position, and motion of the resulting spatiotemporal objects are extracted and are automatically mapped to appropriate intermediate-level descriptors forming a simple vocabulary termed object ontology. This, combined with a relevance feedback mechanism, allows the qualitative definition of the high-level concepts the user queries for (semantic objects, each represented by a keyword) and the retrieval of relevant video segments. Desired spatial and temporal relationships between the objects in multiple-keyword queries can also be expressed, using the shot ontology. Experimental results of the application of the segmentation algorithm to known sequences demonstrate the efficiency of the proposed segmentation approach. Sample queries reveal the potential of employing this segmentation algorithm as part of an object-based video indexing and retrieval scheme.