A novel video summarization based on mining the story-structure and semantic relations among concept entities

Authors:
Bo-Wei Chen;Jia-Ching Wang;Jhing-Fa Wang
Affiliations:
Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan;Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan;Department of Electrical Engineering, National Cheng Kung University, Tainan, Taiwan
Venue:
IEEE Transactions on Multimedia - Special issue on integration of context and content
Year:
2009

Citing 40
Cited 15

A maximum entropy approach to natural language processing

Computational Linguistics
New enhancements to cut, fade, and dissolve detection processes in video segmentation

MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Combined-media video tracking for summarization

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Keyframe-Based User Interfaces for Digital Video

Computer
A user attention model for video summarization

Proceedings of the tenth ACM international conference on Multimedia
X-means: Extending K-means with Efficient Estimation of the Number of Clusters

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Constructing table-of-content for videos

Multimedia Systems - Special section on video libraries
Generating Hypermedia Documents from Transcriptions of Television Programs Using Parallel Text Alignment

RIDE '98 Proceedings of the Workshop on Research Issues in Database Engineering
Exploring Video Structure Beyond The Shots

ICMCS '98 Proceedings of the IEEE International Conference on Multimedia Computing and Systems
Hierarchical video content description and summarization using unified semantic and visual similarity

Multimedia Systems
Retrieval effectiveness of an ontology-based model for information selection

The VLDB Journal — The International Journal on Very Large Data Bases
MMSS: Multi-Modal Story-Oriented Video Summarization

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
Annotation-based multimedia summarization and translation

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Designing an intelligent user interface for instructional video indexing and browsing

Proceedings of the 11th international conference on Intelligent user interfaces
Keyword-based document clustering

AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
Discovering important nodes through graph entropy the case of Enron email database

Proceedings of the 3rd international workshop on Link discovery
Automatic Video Annotation by Mining Speech Transcripts

CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
2006 Special Issue: Modeling attention to salient proto-objects

Neural Networks
A study on automatically extracted keywords in text categorization

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
SenseRelate::TargetWord: a generalized framework for word sense disambiguation

ACLdemo '05 Proceedings of the ACL 2005 on Interactive poster and demonstration sessions
Adaptive estimated maximum-entropy distribution model

Information Sciences: an International Journal
Modeling personal and social network context for event annotation in images

Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
The effect of text in storyboards for video navigation

ICASSP '01 Proceedings of the Acoustics, Speech, and Signal Processing, 2001. on IEEE International Conference - Volume 03
Visual Mining of Multi-Modal Social Networks at Different Abstraction Levels

IV '07 Proceedings of the 11th International Conference Information Visualization
VAST MM: multimedia browser for presentation video

Proceedings of the 6th ACM international conference on Image and video retrieval
Generating comprehensible summaries of rushes sequences based on robust feature matching

Proceedings of the international workshop on TRECVID video summarization
Video rushes summarization by adaptive acceleration and stacking of shots

Proceedings of the international workshop on TRECVID video summarization
An integrated statistical model for multimedia evidence combination

Proceedings of the 15th international conference on Multimedia
Supporting video library exploratory search: when storyboards are not enough

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Introduction to Information Retrieval

Introduction to Information Retrieval
Graph connectivity measures for unsupervised word sense disambiguation

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Spectral structuring of home videos

CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval
Unpacking meaning from words: a context-centered approach to computational lexicon design

CONTEXT'03 Proceedings of the 4th international and interdisciplinary conference on Modeling and using context
A generic framework of user attention model and its application in video summarization

IEEE Transactions on Multimedia
Automated video program summarization using speech transcripts

IEEE Transactions on Multimedia
Video summarization and scene detection by graph modeling

IEEE Transactions on Circuits and Systems for Video Technology
MINMAX optimal video summarization

IEEE Transactions on Circuits and Systems for Video Technology
Information theory-based shot cut/fade detection and video summarization

IEEE Transactions on Circuits and Systems for Video Technology
Clip-based similarity measure for query-dependent clip retrieval and video summarization

IEEE Transactions on Circuits and Systems for Video Technology
A Multiple Visual Models Based Perceptive Analysis Framework for Multilevel Video Summarization

IEEE Transactions on Circuits and Systems for Video Technology

VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method

Pattern Recognition Letters
A novel video thumbnail extraction method using spatiotemporal vector quantization

Proceedings of the 3rd international workshop on Automated information extraction in media production
A content-based rapid video playback method using motion-based video time density function and temporal quantization

Proceedings of the 2010 ACM workshop on Social, adaptive and personalized multimedia interaction and access
Video summarization with visual and semantic features

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Hierarchical keyframe-based video summarization using QR-decomposition and modified k-means clustering

EURASIP Journal on Advances in Signal Processing
Video summarization via transferrable structured learning

Proceedings of the 20th international conference on World wide web
Beyond search: Event-driven summarization for web videos

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
A smart video player with content-based fast-forward playback

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Dynamic social network for narrative video analysis

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Finding the game flow from sports video

J-MRE '11 Proceedings of the 2011 joint ACM workshop on Modeling and representing events
Video summarization with semantic concept preservation

Proceedings of the 10th International Conference on Mobile and Ubiquitous Multimedia
Using eye-tracking data for automatic film comic creation

Proceedings of the Symposium on Eye Tracking Research and Applications
Video search and indexing with reinforcement agent for interactive multimedia services

ACM Transactions on Embedded Computing Systems (TECS) - Special issue on embedded systems for interactive multimedia services (ES-IMS)
Content-Based Keyframe Clustering Using Near Duplicate Keyframe Identification

International Journal of Multimedia Data Engineering & Management
Medical Video Summarization using Central Tendency-Based Shot Boundary Detection

International Journal of Computer Vision and Image Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Video summarization techniques have been proposed for years to offer people comprehensive understanding of the whole story in the video. Roughly speaking, existing approaches can be classified into the two types: one is static storyboard, and the other is dynamic skimming. However, despite that these traditional methods give brief summaries for users, they still do not provide with a concept-organized and systematic view. In this paper, we present a structural video content browsing system and a novel summarization method by utilizing the four kinds of entities: who, what, where, and when to establish the framework of the video contents. With the assistance of the above-mentioned indexed information, the structure of the story can be built up according to the characters, the things, the places, and the time. Therefore, users can not only browse the video efficiently but also focus on what they are interested in via the browsing interface. In order to construct the fundamental system, we employ maximum entropy criterion to integrate visual and text features extracted from video frames and speech transcripts, generating high-level concept entities. A novel concept expansion method is introduced to explore the associations among these entities. After constructing the relational graph, we exploit graph entropy model to detect meaningful shots and relations, which serve as the indices for users. The results demonstrate that our system can achieve better performance and information coverage.