Computational approaches to temporal sampling of video sequences

Authors:
Tiecheng Liu;John R. Kender
Affiliations:
University of South Carolina, Columbia, SC;Columbia University, New York, NY
Venue:
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Year:
2007

Citing 28
Cited 2

A video retrieval and sequencing system

ACM Transactions on Information Systems (TOIS) - Special issue on video information retrieval
Video parsing, retrieval and browsing: an integrated and content-based solution

Proceedings of the third ACM international conference on Multimedia
VideoQ: an automated content based video search system using visual cues

MULTIMEDIA '97 Proceedings of the fifth ACM international conference on Multimedia
Video keyframe extraction and filtering: a keyframe is not a keyframe to everyone

CIKM '97 Proceedings of the sixth international conference on Information and knowledge management
Video summarization by curve simplification

MULTIMEDIA '98 Proceedings of the sixth ACM international conference on Multimedia
Video summarization using R-sequences

Real-Time Imaging
Dynamic selection and effective compression of key frames for video abstraction

Pattern Recognition Letters
Video Skimming and Characterization through the Combination of Image and Language Understanding

CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Constrained Utility Maximization for Generating Visual Skims

CBAIVL '01 Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'01)
Video Skimming and Characterization through the Combination of Image and Language Understanding Techniques

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Efficient matching and clustering of video shots

ICIP '95 Proceedings of the 1995 International Conference on Image Processing (Vol. 1)-Volume 1 - Volume 1
Semantic Video Model for Content-Based Retrieval

ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2
A Semantic Modeling Approach for Video Retrieval by Content

ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2
Time-Constrained Clustering for Segmentation of Video into Story Unites

ICPR '96 Proceedings of the International Conference on Pattern Recognition (ICPR '96) Volume III-Volume 7276 - Volume 7276
A Metric for Distributions with Applications to Image Databases

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Content-Based Video Summarization and Adaptation for Ubiquitous Media Access

ICIAP '03 Proceedings of the 12th International Conference on Image Analysis and Processing
Detecting image near-duplicate by stochastic attributed relational graph matching with learning

Proceedings of the 12th annual ACM international conference on Multimedia
Video summaries and cross-referencing through mosaic-based representation

Computer Vision and Image Understanding
Graph Edit Distance from Spectral Seriation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Salient stills

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Key frame selection by motion analysis

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 02
Time-Constrained Keyframe Selection Technique

ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2
Two-stage hierarchical video summary extraction to match low-level user browsing preferences

IEEE Transactions on Multimedia
Concept-oriented indexing of video databases: toward semantic sensitive retrieval and browsing

IEEE Transactions on Image Processing
Efficient video indexing scheme for content-based retrieval

IEEE Transactions on Circuits and Systems for Video Technology
An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis

IEEE Transactions on Circuits and Systems for Video Technology
Optimal nonlinear sampling for video streaming at low bit rates

IEEE Transactions on Circuits and Systems for Video Technology
A novel video key-frame-extraction algorithm based on perceived motion energy model

IEEE Transactions on Circuits and Systems for Video Technology

Gesture salience as a hidden variable for coreference resolution and keyframe extraction

Journal of Artificial Intelligence Research
Feature extraction and clustering for dynamic video summarisation

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Video key frame extraction is one of the most important research problems for video summarization, indexing, and retrieval. For a variety of applications such as ubiquitous media access and video streaming, the temporal boundaries between video key frames are required for synchronizing visual content with audio. In this article, we define temporal video sampling as a unified process of extracting video key frames and computing their temporal boundaries, and formulate it as an optimization problem. We first provide an optimal approach that minimizes temporal video sampling error using a dynamic programming process. The optimal approach retrieves a key frame hierarchy and all temporal boundaries in O(n4) time and O(n2) space. To further reduce computational complexity, we also provide a suboptimal greedy algorithm that exploits the data structure of a binary heap and uses a novel “look-ahead” computational technique, enabling all levels of key frames to be extracted with an average-case computational time of O(n log n) and memory usage of O(n). Both the optimal and the greedy methods are free of parameters, thus avoiding the threshold-selection problem that exists in other approaches. We empirically compare the proposed optimal and greedy methods with several existing methods in terms of video sampling error, computational cost, and subjective quality. An evaluation of eight videos of different genres shows that the greedy approach achieves performance very close to that of the optimal approach while drastically reducing computational cost, making it suitable for processing long video sequences in large video databases.