A Visual Attention Based Region-of-Interest Determination Framework for Video Sequences*

Authors:
Wen-Huang Cheng;Wei-Ta Chu;Ja-Ling Wu
Affiliations:
The authors are with the Graduate Institute of Networking and Multimedia, National Taiwan University, Taipei, Taiwan, 10617, R.O.C. E-mail: wisley@cmlab.csie.ntu.edu.tw, E-mail: wjl@cmlab.csie.ntu ...;The author is with the Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, 10617, R.O.C. E-mail: wtchu@cmlab.csie.ntu.edu.tw;The authors are with the Graduate Institute of Networking and Multimedia, National Taiwan University, Taipei, Taiwan, 10617, R.O.C. E-mail: wisley@cmlab.csie.ntu.edu.tw, E-mail: wjl@cmlab.csie.ntu ...
Venue:
IEICE - Transactions on Information and Systems
Year:
2005

Citing 0
Cited 14

A fusion architecture based on TBM for camera motion classification

Image and Vision Computing
Efficient spatiotemporal-attention-driven shot matching

Proceedings of the 15th international conference on Multimedia
Towards efficient context-specific video coding based on gaze-tracking analysis

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Automatic Labeling of Colonoscopy Video for Cancer Detection

IbPRIA '07 Proceedings of the 3rd Iberian conference on Pattern Recognition and Image Analysis, Part I
Attention-driven action retrieval with DTW-based 3d descriptor matching

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Place retrieval with graph-based place-view model

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Object motion detection using information theoretic spatio-temporal saliency

Pattern Recognition
A multicue Bayesian state estimator for gaze prediction in open signed video

IEEE Transactions on Multimedia
An approach to intelligently crop and scale video for broadcast applications

Proceedings of the 2010 ACM Symposium on Applied Computing
Actor-independent action search using spatiotemporal vocabulary with appearance hashing

Pattern Recognition
Travelmedia: An intelligent management system for media captured in travel

Journal of Visual Communication and Image Representation
Contextual cropping and scaling of TV productions

Multimedia Tools and Applications
Mining spatiotemporal video patterns towards robust action retrieval

Neurocomputing
Spatiotemporal saliency detection and salient region determination for H.264 videos

Journal of Visual Communication and Image Representation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a framework for automatic video region-of-interest determination based on visual attention model. We view this work as a preliminary step towards the solution of high-level semantic video analysis. Facing such a challenging issue, in this work, a set of attempts on using video attention features and knowledge of computational media aesthetics are made. The three types of visual attention features we used are intensity, color, and motion. Referring to aesthetic principles, these features are combined according to camera motion types on the basis of a new proposed video analysis unit, frame-segment. We conduct subjective experiments on several kinds of video data and demonstrate the effectiveness of the proposed framework.