Temporal video structuring for preservation and annotation of video content

Authors:
Christian Petersohn
Affiliations:
Fraunhofer Institute for Telecommunications, Heinrich-Hertz-Institut, Berlin, Germany
Venue:
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Year:
2009

Citing 11
Cited 3

An intuitive and efficient access interface to real-time incoming video based on automatic indexing

Proceedings of the third ACM international conference on Multimedia
The Filmmaker's Handbook: A Comprehensive Guide for the Digital Age

The Filmmaker's Handbook: A Comprehensive Guide for the Digital Age
Constructing table-of-content for videos

Multimedia Systems - Special section on video libraries
Multi-Level Video Representation with Application to Keyframe Extraction

MMM '04 Proceedings of the 10th International Multimedia Modelling Conference
A review of real-time segmentation of uncompressed video sequences for content-based search and retrieval

Real-Time Imaging
Joint Key-Frame Extraction and Object-Based Video Segmentation

WACV-MOTION '05 Proceedings of the IEEE Workshop on Motion and Video Computing (WACV/MOTION'05) - Volume 2 - Volume 02
An Approach to Video Key-frame Extraction Based on Rough Set

MUE '07 Proceedings of the 2007 International Conference on Multimedia and Ubiquitous Engineering
Automated high-level movie segmentation for advanced video-retrieval systems

IEEE Transactions on Circuits and Systems for Video Technology
Optimization-based automated home video editing system

IEEE Transactions on Circuits and Systems for Video Technology
A new key frame representation for video segment retrieval

IEEE Transactions on Circuits and Systems for Video Technology
A Formal Study of Shot Boundary Detection

IEEE Transactions on Circuits and Systems for Video Technology

Role-based identity recognition for telecasts

Proceedings of the 3rd international workshop on Automated information extraction in media production
Role-based identity recognition for TV broadcasts

Multimedia Tools and Applications
CONTENTUS--technologies for next generation multimedia libraries

Multimedia Tools and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

As books have chapters, sections, paragraphs, sentences, etc., videos have an inherent hierarchical structure as well. Chapters, scenes, shots and sub-shots are the temporal units in videos. Because manual structure extraction is time-consuming, automatic segmentation has been a research effort in the past 10 to 15 years and is a prerequisite for efficient video indexing, annotation, search and retrieval. This paper focuses on our recent research in the fields of scene, shot and sub-shot extraction and their combination into a video structure detection system. The first step is the detection of shot transitions with separate detectors for hard cuts, fades, dissolves and wipes. Then complex shots are further segmented into semantically meaningful units called sub-shots. Finally the results are employed to extract scenes. We propose to use film-grammar based on shot transition types to improve the results of scene detection. The algorithms proposed are robust to distortions and artefacts found in digitized archived video.