Hierarchical video content description and summarization using unified semantic and visual similarity

Authors:
Xingquan Zhu;Jianping Fan;Ahmed K. Elmagarmid;Xindong Wu
Affiliations:
Department of Computer Science, University of Vermont, Burlington, VT;Department of Computer Science, University of North Carolina, Charlotte, NC;Department of Computer Science, Purdue University, West Lafayette, IN;Department of Computer Science, University of Vermont, Burlington, VT
Venue:
Multimedia Systems
Year:
2003

Citing 39
Cited 21

Automatic partitioning of full-motion video

Multimedia Systems
Content-Based Video Indexing and Retrieval

IEEE MultiMedia
Query expansion using lexical-semantic relations

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Video parsing, retrieval and browsing: an integrated and content-based solution

Proceedings of the third ACM international conference on Multimedia
ConText towards the evolving documentary

Proceedings of the third ACM international conference on Multimedia
WordNet: a lexical database for English

Communications of the ACM
Video abstracting

Communications of the ACM
Video summarization by curve simplification

MULTIMEDIA '98 Proceedings of the sixth ACM international conference on Multimedia
Visual digests for news video libraries

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Video Manga: generating semantically meaningful video summaries

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Auto-summarization of audio-video presentations

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 1)
Abstracting home video automatically

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 2)
Dynamic video summarization and visualization

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 2)
A unified framework for semantics and feature based relevance feedback in image retrieval systems

MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
An integrated scheme for object-based video abstraction

MULTIMEDIA '00 Proceedings of the eighth ACM international conference on Multimedia
Rule-based video classification system for basketball video indexing

MULTIMEDIA '00 Proceedings of the 2000 ACM workshops on Multimedia
Name-It: Naming and Detecting Faces in News Videos

IEEE MultiMedia
Ontology-Based Photo Annotation

IEEE Intelligent Systems
Modelling and Querying Video Data

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
The Stratification System - A Design Emvironment for Random Access

Proceedings of the Third International Workshop on Network and Operating System Support for Digital Audio and Video
Constructing table-of-content for videos

Multimedia Systems - Special section on video libraries
Adjustable Filmstrips and Skims as Abstractions for a Digital Video Library

ADL '99 Proceedings of the IEEE Forum on Research and Technology Advances in Digital Libraries
Video Scene Segmentation via Continuous Video Coherence

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Hierachical Brushing in a Collection of Video Data

HICSS '01 Proceedings of the 34th Annual Hawaii International Conference on System Sciences ( HICSS-34)-Volume 4 - Volume 4
Time-Constrained Clustering for Segmentation of Video into Story Unites

ICPR '96 Proceedings of the International Conference on Pattern Recognition (ICPR '96) Volume III-Volume 7276 - Volume 7276
VideoText Database Systems

ICMCS '97 Proceedings of the 1997 International Conference on Multimedia Computing and Systems
An Architecture for Video Content Filtering in Consumer Domain

ITCC '00 Proceedings of the The International Conference on Information Technology: Coding and Computing (ITCC'00)
The role of high-level and low-level features in semi-automated retrieval and generation of multimedia presentations

The role of high-level and low-level features in semi-automated retrieval and generation of multimedia presentations
A Bayesian Framework for Content-Based Indexing and Retrieval

DCC '98 Proceedings of the Conference on Data Compression
Automatic Video Scene Extraction by Shot Grouping

ICPR '00 Proceedings of the International Conference on Pattern Recognition - Volume 4
A Distributed Database Server for Continuous Media

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Sequential association mining for video summarization

ICME '03 Proceedings of the 2003 International Conference on Multimedia and Expo - Volume 3 (ICME '03) - Volume 03
Video classification using transform coefficients

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
Detecting Hunts in Wildlife Videos

ICMCS '99 Proceedings of the IEEE International Conference on Multimedia Computing and Systems - Volume 2
Mining video associations for efficient database management

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Rapid scene analysis on compressed video

IEEE Transactions on Circuits and Systems for Video Technology
Video visualization for compact presentation and fast browsing of pictorial content

IEEE Transactions on Circuits and Systems for Video Technology
Efficient summarization of stereoscopic video sequences

IEEE Transactions on Circuits and Systems for Video Technology
Automatic model-based semantic object extraction algorithm

IEEE Transactions on Circuits and Systems for Video Technology

Video summarization based on user log enhanced link analysis

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts

Proceedings of the 15th international conference on Multimedia
Measuring novelty and redundancy with multiple modalities in cross-lingual broadcast news

Computer Vision and Image Understanding
A Fuzzy Rule-Based System with Ontology for Summarization of Multi-camera Event Sequences

ICAISC '08 Proceedings of the 9th international conference on Artificial Intelligence and Soft Computing
The e-Sentencias Prototype: A Procedural Ontology for Legal Multimedia Applications in the Spanish Civil Courts

Proceedings of the 2009 conference on Law, Ontologies and the Semantic Web: Channelling the Legal Information Flood
Gesture salience as a hidden variable for coreference resolution and keyframe extraction

Journal of Artificial Intelligence Research
A novel video summarization based on mining the story-structure and semantic relations among concept entities

IEEE Transactions on Multimedia - Special issue on integration of context and content
Hierarchical modeling and adaptive clustering for real-time summarization of rush videos

IEEE Transactions on Multimedia
Iterative incremental shot clustering algorithm by Haar wavelets

IMSA '07 Proceedings of the Eleventh IASTED International Conference on Internet and Multimedia Systems and Applications
Vlogging: A survey of videoblogging technology on the web

ACM Computing Surveys (CSUR)
An adaptive and efficient unsupervised shot clustering algorithm for sports video

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
OLYBIA: ontology-based automatic image annotation system using semantic inference rules

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Feature extraction and clustering for dynamic video summarisation

Neurocomputing
Dealing with the video tidal wave: the relevance of expertise for video tagging

Proceedings of the 21st ACM conference on Hypertext and hypermedia
Activity-driven content adaptation for effective video summarization

Journal of Visual Communication and Image Representation
A content-based rapid video playback method using motion-based video time density function and temporal quantization

Proceedings of the 2010 ACM workshop on Social, adaptive and personalized multimedia interaction and access
Video summarization via transferrable structured learning

Proceedings of the 20th international conference on World wide web
Beyond search: Event-driven summarization for web videos

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
A smart video player with content-based fast-forward playback

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Full-Automatic high-level concept extraction from images using ontologies and semantic inference rules

ASWC'06 Proceedings of the First Asian conference on The Semantic Web
Flexible navigation in smartphones and tablets using scalable storyboards

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Video is increasingly the medium of choice for a variety of communication channels, resulting primarily from increased levels of networked multimedia systems. One way to keep our heads above the video sea is to provide summaries in a more tractable format. Many existing approaches are limited to exploring important low-level feature related units for summarization. Unfortunately, the semantics, content and structure of the video do not correspond to low-level features directly, even with closed-captions, scene detection, and audio signal processing. The drawbacks of existing methods are the following: (1) instead of unfolding semantics and structures within the video, low-level units usually address only the details, and (2) any important unit selection strategy based on low-level features cannot be applied to general videos. Providing users with an overview of the video content at various levels of summarization is essential for more efficient database retrieval and browsing. In this paper, we present a hierarchical video content description and summarization strategy supported by a novel joint semantic and visual similarity strategy. To describe the video content efficiently and accurately, a video content description ontology is adopted. Various video processing techniques are then utilized to construct a semi-automatic video annotation framework. By integrating acquired content description data, a hierarchical video content structure is constructed with group merging and clustering. Finally, a four layer video summary with different granularities is assembled to assist users in unfolding the video content in a progressive way. Experiments on real-word videos have validated the effectiveness of the proposed approach.