Near-lossless semantic video summarization and its applications to video analysis

Authors:
Tao Mei;Lin-Xie Tang;Jinhui Tang;Xian-Sheng Hua
Affiliations:
Microsoft Research Asia, China;University of Science and Technology of China, China;Nanjing University of Science and Technology, China;Microsoft, USA
Venue:
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Year:
2013

Citing 34
Cited 0

Automatic partitioning of full-motion video

Multimedia Systems
An interactive comic book presentation for exploring video

Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
A user attention model for video summarization

Proceedings of the tenth ACM international conference on Multimedia
Dynamic key frame presentation techniques for augmenting video browsing

AVI '98 Proceedings of the working conference on Advanced visual interfaces
Sports video summarization using highlights and play-breaks

MIR '03 Proceedings of the 5th ACM SIGMM international workshop on Multimedia information retrieval
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Content-based multimedia information retrieval: State of the art and challenges

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Automatic summarization of music videos

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Large-Scale Concept Ontology for Multimedia

IEEE MultiMedia
The challenge problem for automated detection of 101 semantic concepts in multimedia

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Video abstraction: A systematic review and classification

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Content-driven adaptation of on-line video

Image Communication
Clever clustering vs. simple speed-up for summarizing rushes

Proceedings of the international workshop on TRECVID video summarization
Structure and event mining in sports video with efficient mosaic

Multimedia Tools and Applications
Reranking Methods for Visual Search

IEEE MultiMedia
The trecvid 2008 BBC rushes summarization evaluation

TVS '08 Proceedings of the 2nd ACM TRECVid Video Summarization Workshop
Video collage: presenting a video sequence using a single image

The Visual Computer: International Journal of Computer Graphics
Automatic personalized video abstraction for sports videos using metadata

Multimedia Tools and Applications
CrowdReranking: exploring multiple search engines for visual search reranking

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Near-lossless video summarization

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Real-time near-duplicate elimination for web video search with content and context

IEEE Transactions on Multimedia - Special issue on integration of context and content
Scale-rotation invariant pattern entropy for keypoint-based near-duplicate detection

IEEE Transactions on Image Processing
VideoSense: a contextual in-video advertising system

IEEE Transactions on Circuits and Systems for Video Technology
Scalable clip-based near-duplicate video detection with ordinal measure

Proceedings of the ACM International Conference on Image and Video Retrieval
Audio-visual atoms for generic video concept classification

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Community discovery from movie and its application to poster generation

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Near-Duplicate Keyframe Identification With Interest Point Matching and Pattern Learning

IEEE Transactions on Multimedia
Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study

IEEE Transactions on Multimedia
Video Annotation Through Search and Graph Reinforcement Mining

IEEE Transactions on Multimedia
A unified approach to shot change detection and camera motion characterization

IEEE Transactions on Circuits and Systems for Video Technology
Object-based video abstraction for video surveillance systems

IEEE Transactions on Circuits and Systems for Video Technology
Overview of the H.264/AVC video coding standard

IEEE Transactions on Circuits and Systems for Video Technology
Home Video Visual Quality Assessment With Spatiotemporal Factors

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

The ever increasing volume of video content on the Web has created profound challenges for developing efficient indexing and search techniques to manage video data. Conventional techniques such as video compression and summarization strive for the two commonly conflicting goals of low storage and high visual and semantic fidelity. With the goal of balancing both video compression and summarization, this article presents a novel approach, called Near-Lossless Semantic Summarization (NLSS), to summarize a video stream with the least high-level semantic information loss by using an extremely small piece of metadata. The summary consists of compressed image and audio streams, as well as the metadata for temporal structure and motion information. Although at a very low compression rate (around ¼0; of H.264 baseline, where traditional compression techniques can hardly preserve an acceptable visual fidelity), the proposed NLSS still can be applied to many video-oriented tasks, such as visualization, indexing and browsing, duplicate detection, concept detection, and so on. We evaluate the NLSS on TRECVID and other video collections, and demonstrate that it is a powerful tool for significantly reducing storage consumption, while keeping high-level semantic fidelity.