A generic framework of user attention model and its application in video summarization

Authors:
Yu-Fei Ma;Xian-Sheng Hua;Lie Lu;Hong-Jiang Zhang
Affiliations:
Microsoft Res. Asia, Beijing, China;-;-;-
Venue:
IEEE Transactions on Multimedia
Year:
2005

Citing 0
Cited 57

Tiling slideshow

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Audiovisual slideshow: present your journey by photos

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Content-driven adaptation of on-line video

Image Communication
Efficient spatiotemporal-attention-driven shot matching

Proceedings of the 15th international conference on Multimedia
Real-time tracking of visually attended objects in interactive virtual environments

Proceedings of the 2007 ACM symposium on Virtual reality software and technology
Video summarisation: A conceptual framework and survey of the state of the art

Journal of Visual Communication and Image Representation
Visual islands: intuitive browsing of visual search results

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Automatic creation and evaluation of MPEG-7 compliant summary descriptions for generic audiovisual content

Image Communication
An efficient algorithm for attention-driven image interpretation from segments

Pattern Recognition
A generic virtual content insertion system based on visual attention analysis

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Attention-from-motion: A factorization approach for detecting attention objects in motion

Computer Vision and Image Understanding
A User Experience Model for Home Video Summarization

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Modelling Spatio-Temporal Saliency to Predict Gaze Direction for Short Videos

International Journal of Computer Vision
An integrated approach to summarization and adaptation using H.264/MPEG-4 SVC

Image Communication
Movie story intensity representation through audiovisual tempo analysis

Multimedia Tools and Applications
Spatiotemporal saliency for video classification

Image Communication
On the use of hierarchical prediction structures for efficient summary generation of H.264/AVC bitstreams

Image Communication
Multi-video synopsis for video representation

Signal Processing
Evolving virtual contents with interactions in videos

IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Evolution-based virtual content insertion

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Robust region-of-interest determination based on user attention model through visual rhythm analysis

IEEE Transactions on Circuits and Systems for Video Technology
Content-based attention ranking using visual and contextual attention model for baseball videos

IEEE Transactions on Multimedia - Special issue on integration of context and content
A novel video summarization based on mining the story-structure and semantic relations among concept entities

IEEE Transactions on Multimedia - Special issue on integration of context and content
Hierarchical modeling and adaptive clustering for real-time summarization of rush videos

IEEE Transactions on Multimedia
Surveillance Audio Attention Model Based on Spatial Audio Cues

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Content-based hierarchical motion description for multiple video adaptation

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Spatial-temporal video browsing for mobile environment based on visual attention analysis

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Taxonomy of directing semantics for film shot classification

IEEE Transactions on Circuits and Systems for Video Technology
Motion attention based frame-level bit allocation scheme for H.264

Proceedings of the First International Conference on Internet Multimedia Computing and Service
Vlogging: A survey of videoblogging technology on the web

ACM Computing Surveys (CSUR)
Denoising saliency map for region of interest extraction

VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
Aesthetics-based automatic home video skimming system

MMM'08 Proceedings of the 14th international conference on Advances in multimedia modeling
A comparative study on attention-based rate adaptation for scalable video coding

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Temporal salient graph for sports event detection

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video

International Journal of Computer Vision
Activity-driven content adaptation for effective video summarization

Journal of Visual Communication and Image Representation
Bridging low-level features and high-level semantics via fMRI brain imaging for video classification

Proceedings of the international conference on Multimedia
Character-based movie summarization

Proceedings of the international conference on Multimedia
Video summarization with visual and semantic features

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Human-centered attention models for video summarization

International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction
Stereoscopic visual attention-based regional bit allocation optimization for multiview video coding

EURASIP Journal on Advances in Signal Processing
Perceptual visual quality metrics: A survey

Journal of Visual Communication and Image Representation
Musical slideshow: boosting user experience in photo presentation

Multimedia Tools and Applications
The role of attractiveness in web image search

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Key frame extraction based on visual attention model

Journal of Visual Communication and Image Representation
Video summarization with semantic concept preservation

Proceedings of the 10th International Conference on Mobile and Ubiquitous Multimedia
Stereoscopic visual attention model for 3d video

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Attention prediction in egocentric video using motion and visual saliency

PSIVT'11 Proceedings of the 5th Pacific Rim conference on Advances in Image and Video Technology - Volume Part I
Salient object detection: a benchmark

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Efficient visual attention based framework for extracting key frames from videos

Image Communication
A novel video salient object extraction method based on visual attention

Image Communication
Video abstraction based on the visual attention model and online clustering

Image Communication
The co-attention model for tiny activity analysis

Neurocomputing
A novel H.264 rate control algorithm with consideration of visual attention

Multimedia Tools and Applications
Dynamic saliency models and human attention: a comparative study on videos

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Spatiotemporal saliency detection and salient region determination for H.264 videos

Journal of Visual Communication and Image Representation
'Mind the gap': evaluating user physiological response for multi-genre video summarisation

BCS-HCI '13 Proceedings of the 27th International BCS Human Computer Interaction Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

Due to the information redundancy of video, automatically extracting essential video content is one of key techniques for accessing and managing large video library. In this paper, we present a generic framework of a user attention model, which estimates the attentions viewers may pay to video contents. As human attention is an effective and efficient mechanism for information prioritizing and filtering, user attention model provides an effective approach to video indexing based on importance ranking. In particular, we define viewer attention through multiple sensory perceptions, i.e. visual and aural stimulus as well as partly semantic understanding. Also, a set of modeling methods for visual and aural attentions are proposed. As one of important applications of user attention model, a feasible solution of video summarization, without fully semantic understanding of video content as well as complex heuristic rules, is implemented to demonstrate the effectiveness, robustness, and generality of the user attention model. The promising results from the user study on video summarization indicate that the user attention model is an alternative way to video understanding.