Static saliency vs. dynamic saliency: a comparative study

Authors:
Tam V. Nguyen;Mengdi Xu;Guangyu Gao;Mohan Kankanhalli;Qi Tian;Shuicheng Yan
Affiliations:
National University of Singapore, Singapore, Singapore;National University of Singapore, Singapore, Singapore;Beijing Institute of Technology, Beijing, China;National University of Singapore, Singapore, Singapore;University of Texas at San Antonio, San Antonio, USA;National University of Singapore, Singapore, Singapore
Venue:
Proceedings of the 21st ACM international conference on Multimedia
Year:
2013

Citing 15
Cited 0

Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Picture Collage

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Visual attention detection in video sequences using spatiotemporal cues

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Detecting Irregularities in Images and in Video

International Journal of Computer Vision
Seam carving for content-aware image resizing

ACM SIGGRAPH 2007 papers
LabelMe: A Database and Web-Based Tool for Image Annotation

International Journal of Computer Vision
Probabilistic Multi-Task Learning for Visual Saliency Estimation in Video

International Journal of Computer Vision
Dynamic captioning: video accessibility enhancement for hearing impairment

Proceedings of the international conference on Multimedia
An eye fixation database for saliency detection in images

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
ImageSense: Towards contextual image advertising

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Saliency estimation using a non-parametric low-level vision model

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Sense beauty via face, dressing, and/or voice

Proceedings of the 20th ACM international conference on Multimedia
Depth matters: influence of depth cues on visual saliency

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Dynamic eye movement datasets and learnt saliency models for visual action recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Towards decrypting attractiveness via multi-modality cues

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently visual saliency has attracted wide attention of researchers in the computer vision and multimedia field. However, most of the visual saliency-related research was conducted on still images for studying static saliency. In this paper, we give a comprehensive comparative study for the first time of dynamic saliency (video shots) and static saliency (key frames of the corresponding video shots), and two key observations are obtained: 1) video saliency is often different from, yet quite related with, image saliency, and 2) camera motions, such as tilting, panning or zooming, affect dynamic saliency significantly. Motivated by these observations, we propose a novel camera motion and image saliency aware model for dynamic saliency prediction. The extensive experiments on two static-vs-dynamic saliency datasets collected by us show that our proposed method outperforms the state-of-the-art methods for dynamic saliency prediction. Finally, we also introduce the application of dynamic saliency prediction for dynamic video captioning, assisting people with hearing impairments to better entertain videos with only off-screen voices, e.g., documentary films, news videos and sports videos.