Non-sequential multiscale content-based video decomposition

Authors:
Nikolaos Doulamis;Anastasios Doulamis
Affiliations:
Electrical and Computer Engineering Department, Computer Science Division, National Technical University of Athens, 11.23 office, 9, Heroon Polytechniou Street, Zografou 15773, Athens, Greece;Electrical and Computer Engineering Department, Computer Science Division, National Technical University of Athens, 11.23 office, 9, Heroon Polytechniou Street, Zografou 15773, Athens, Greece
Venue:
Signal Processing - Special section on content-based image and video retrieval
Year:
2005

Citing 23
Cited 0

Vector quantization and signal compression

Vector quantization and signal compression
Color indexing

International Journal of Computer Vision
A magnifier tool for video data

CHI '92 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Content-Based Video Indexing and Retrieval

IEEE MultiMedia
Content-based browsing of video sequences

MULTIMEDIA '94 Proceedings of the second ACM international conference on Multimedia
Digital video processing

Digital video processing
Video and image processing in multimedia systems

Video and image processing in multimedia systems
A fuzzy video content representation for video summarization and content-based retrieval

Signal Processing - Special issue on fuzzy logic in signal processing
JPEG Still Image Data Compression Standard

JPEG Still Image Data Compression Standard
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Efficient Color Histogram Indexing for Quadratic Form Distance Functions

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Spatiotemporal Motion Model for Video Summarization

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Hierachical Brushing in a Collection of Video Data

HICSS '01 Proceedings of the 34th Annual Hawaii International Conference on System Sciences ( HICSS-34)-Volume 4 - Volume 4
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
VideoZoom spatio-temporal video browser

IEEE Transactions on Multimedia
Video browsing system based on compressed domain feature extraction

IEEE Transactions on Consumer Electronics
Non-sequential video content representation using temporal variation of feature vectors

IEEE Transactions on Consumer Electronics
Rapid scene analysis on compressed video

IEEE Transactions on Circuits and Systems for Video Technology
Video visualization for compact presentation and fast browsing of pictorial content

IEEE Transactions on Circuits and Systems for Video Technology
Low bit-rate coding of image sequences using adaptive regions of interest

IEEE Transactions on Circuits and Systems for Video Technology
An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis

IEEE Transactions on Circuits and Systems for Video Technology
Efficient summarization of stereoscopic video sequences

IEEE Transactions on Circuits and Systems for Video Technology
An overview of the MPEG-7 description definition language (DDL)

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a multiscale content-based video decomposition scheme is presented for efficient non-linear (nonsequential) organization of the video visual content. In particular, each video file is analyzed in a multiscale structure of different "content resolution levels", creating a hierarchy from the lowest (coarse) to the highest (fine) resolution. The scheme resembles the progressive transmission of still images, where instead of transmitting the image sequentially at a full resolution, by scanning it line by line, a lower image resolution is first delivered and then, the image quality gradually enhances so that the user is able at any time to see a preview of the image content. The proposed video decomposition is represented as a graph structure, each level of which corresponds to a particular content resolution, while the graph-nodes the respective regions that the content is analyzed at this level. Transitions among nodes of the same level are also permitted. The number of nodes at a given level expresses the degree of detail that the content at this level is analyzed. This number is estimated by minimizing the average transmitted information, required for localizing a video segment of interest and also takes into account the content complexity.Quality criteria are introduced to evaluate the efficiency of the proposed scheme. The efficiency of the organization is maximized if multiscale content decomposition is performed using content representatives and constructing content classes. Content representatives are estimated in our approach as the ones of the maximum dissimilarity, expressed by a distance metric. The optimization is conducted by incorporating a stochastic algorithm of logarithmically reduced searching area (stochastic logarithmic). Experimental results on real-life video sequences show that the proposed multiscale video organization enables users to detect content of interest much faster, compared to the conventional sequential video scanning or other video decomposition/summarization methods, resulting in a better organization efficiency as measured by the quality criteria.