Automatic foveation for video compression using a neurobiological model of visual attention

Authors:
L. Itti
Affiliations:
Psychol. & Neurosci. Graduate Program, Univ. of Southern California, Los Angeles, CA, USA
Venue:
IEEE Transactions on Image Processing
Year:
2004

Citing 0
Cited 50

Rapid Biologically-Inspired Scene Classification Using Features Shared with Visual Attention

IEEE Transactions on Pattern Analysis and Machine Intelligence
Attention-based similarity

Pattern Recognition
Recent advances in rate control for video coding

Image Communication
Towards efficient context-specific video coding based on gaze-tracking analysis

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Priority coding for video-telephony applications based on visual attention

MobiMedia '06 Proceedings of the 2nd international conference on Mobile multimedia communications
Perceptual image retrieval using eye movements

International Journal of Computer Mathematics - Computer Vision and Pattern Recognition
An efficient algorithm for attention-driven image interpretation from segments

Pattern Recognition
Design Principles and Constraints Underlying the Construction of Brain-Based Devices

Neural Information Processing
A generic virtual content insertion system based on visual attention analysis

MM '08 Proceedings of the 16th ACM international conference on Multimedia
A Novel Hierarchical Framework for Object-Based Visual Attention

Attention in Cognitive Systems
Spatiotemporal saliency for video classification

Image Communication
A multicue Bayesian state estimator for gaze prediction in open signed video

IEEE Transactions on Multimedia
Computational visual attention systems and their cognitive foundations: A survey

ACM Transactions on Applied Perception (TAP)
Review article: Object-based video coding with dynamic quality control

Image and Vision Computing
Effect of compressed offline foveated video on viewing behavior and subjective quality

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Perception-oriented video coding based on foveated JND model

PCS'09 Proceedings of the 27th conference on Picture Coding Symposium
Video coding based on audio-visual attention

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Cloudlet-screen computing: a multi-core-based, cloud-computing-oriented, traditional-computing-compatible parallel computing paradigm for the masses

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
What we see is most likely to be what matters: visual attention and applications

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Perceptually-friendly H.264/AVC video coding

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
A novel multiresolution spatiotemporal saliency detection model and its applications in image and video compression

IEEE Transactions on Image Processing
Foveated mean squared error--a novel video quality metric

Multimedia Tools and Applications
Accurate and efficient method for smoothly space-variant Gaussian blurring

IEEE Transactions on Image Processing
Do video coding impairments disturb the visual attention deployment?

Image Communication
Visual attention guided bit allocation in video compression

Image and Vision Computing
Attention-based video streaming

Image Communication
A novel approach to FRUC using discriminant saliency and frame segmentation

IEEE Transactions on Image Processing
A framework for error protection of region of interest coded images and videos

Image Communication
Saliency-based fidelity adaptation preprocessing for video coding

Journal of Computer Science and Technology - Special issue on natural language processing
A scheme for attentional video compression

PReMI'11 Proceedings of the 4th international conference on Pattern recognition and machine intelligence
Efficient video coding based on audio-visual focus of attention

Journal of Visual Communication and Image Representation
An attention based similarity measure for colour images

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II
Entropy reduction of foveated DCT images

ACIVS'05 Proceedings of the 7th international conference on Advanced Concepts for Intelligent Vision Systems
Perceptual image retrieval using eye movements

IWICPAS'06 Proceedings of the 2006 Advances in Machine Vision, Image Processing, and Pattern Analysis international conference on Intelligent Computing in Pattern Analysis/Synthesis
Motion perception based adaptive quantization for video coding

PCM'05 Proceedings of the 6th Pacific-Rim conference on Advances in Multimedia Information Processing - Volume Part I
A saliency detection model based on local and global kernel density estimation

ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part I
A novel nonparametric approach for saliency detection using multiple features

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part II
Salient object detection: a benchmark

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Assessment of computational visual attention models on medical images

Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing
Saliency maps of high dynamic range images

ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part II
Learning saliency-based visual attention: A review

Signal Processing
Dynamic saliency models and human attention: a comparative study on videos

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
Virtual photograph based saliency analysis of high dynamic range images

Proceedings of the Symposium on Computational Aesthetics
Visual saliency detection using information divergence

Pattern Recognition
Special Section on 3D Object Retrieval: Efficient 3D object recognition using foveated point clouds

Computers and Graphics
Steganography in coloured images using wavelet domain-based saliency map

International Journal of Information and Computer Security
Spatiotemporal saliency detection and salient region determination for H.264 videos

Journal of Visual Communication and Image Representation
Saliency-Based region log covariance feature for image copy detection

IWDW'12 Proceedings of the 11th international conference on Digital Forensics and Watermaking
Visual saliency guided video compression algorithm

Image Communication
Attention-Based Health Monitoring

International Journal of Monitoring and Surveillance Technologies Research

Quantified Score

Hi-index	0.01

Visualization

Abstract

We evaluate the applicability of a biologically-motivated algorithm to select visually-salient regions of interest in video streams for multiply-foveated video compression. Regions are selected based on a nonlinear integration of low-level visual cues, mimicking processing in primate occipital, and posterior parietal cortex. A dynamic foveation filter then blurs every frame, increasingly with distance from salient locations. Sixty-three variants of the algorithm (varying number and shape of virtual foveas, maximum blur, and saliency competition) are evaluated against an outdoor video scene, using MPEG-1 and constant-quality MPEG-4 (DivX) encoding. Additional compression radios of 1.1 to 8.5 are achieved by foveation. Two variants of the algorithm are validated against eye fixations recorded from four to six human observers on a heterogeneous collection of 50 video clips (over 45 000 frames in total). Significantly higher overlap than expected by chance is found between human and algorithmic foveations. With both variants, foveated clips are, on average, approximately half the size of unfoveated clips, for both MPEG-1 and MPEG-4. These results suggest a general-purpose usefulness of the algorithm in improving compression ratios of unconstrained video.