Spatio-temporal visual distortion and rate optimization for video coding

  • Authors:
  • Fangzhen Hu;Li Su;Honggang Qi;Qingming Huang

  • Affiliations:
  • Graduate University of Chinese Academy of Sciences, Beijing, China;Graduate University of Chinese Academy of Sciences, Beijing, China;Graduate University of Chinese Academy of Sciences, Beijing, China;Graduate University of Chinese Academy of Sciences, Beijing, China

  • Venue:
  • PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Rate-distortion optimization (RDO) plays a significant role in video coding. However, in most RDO methods, the distortion measurement metrics consider only the spatial distortion of statistical pixel errors. People have concerns about not only the information of independent pixels, but also the spatial and temporal correlations between them. In order to make the distortion assessment more consistent with human perception, temporal information of the successive images and the characteristics of human visual perception should be considered as well. In this paper, we propose a rate-distortion model based on spatio-temporal video structural similarity (stVSSIM) index, which takes both spatial and temporal visual quality into account. Meanwhile, to obtain a reasonable trade-off between bit-rate and visual quality dynamically, a perceptual adaptive Lagrange multiplier selection method is presented. Simulation results show that the proposed method averagely reduces 20% bit-rate under the equal visual quality and the adaptive Lagrange multiplier can further improve the results.