Stereoscopic visual attention-based regional bit allocation optimization for multiview video coding

  • Authors:
  • Yun Zhang;Gangyi Jiang;Mei Yu;Ken Chen;Qionghai Dai

  • Affiliations:
  • Faculty of Information Science and Engineering, Ningbo University, Ningbo, China and Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China;Faculty of Information Science and Engineering, Ningbo University, Ningbo, China;Faculty of Information Science and Engineering, Ningbo University, Ningbo, China;Faculty of Information Science and Engineering, Ningbo University, Ningbo, China;Broadband Networks & Digital Media Lab, Tsinghua University, Beijing, China

  • Venue:
  • EURASIP Journal on Advances in Signal Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a Stereoscopic Visual Attention- (SVA-) based regional bit allocation optimization for Multiview Video Coding (MVC) by the exploiting visual redundancies from human perceptions. We propose a novel SVA model, where multiple perceptual stimuli including depth, motion, intensity, color, and orientation contrast are utilized, to simulate the visual attention mechanisms of human visual system with stereoscopic perception. Then, a semantic region-of-interest (ROI) is extracted based on the saliency maps of SVA. Both objective and subjective evaluations of extracted ROIs indicated that the proposed SVA model based on ROI extraction scheme outperforms the schemes only using spatial or/and temporal visual attention clues. Finally, by using the extracted SVA-based ROIs, a regional bit allocation optimization scheme is presented to allocate more bits on SVA-based ROIs for high image quality and fewer bits on background regions for efficient compression purpose. Experimental results on MVC show that the proposed regional bit allocation algorithm can achieve over 20 ∼ 30% bit-rate saving while maintaining the subjective image quality. Meanwhile, the image quality of ROIs is improved by 0.46 ∼ 0.61 dB at the cost of insensitive image quality degradation of the background image.