A close-form iterative algorithm for depth inferring from a single image

  • Authors:
  • Yang Cao;Yan Xia;Zengfu Wang

  • Affiliations:
  • Automation Department, University of Science and Technology of China, Hefei, China;Automation Department, University of Science and Technology of China, Hefei, China;Automation Department, University of Science and Technology of China, Hefei, China

  • Venue:
  • ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Inferring depth from a single image is a difficult task in computer vision, which needs to utilize adequate monocular cues contained in the image. Inspired by Saxena et al's work, this paper presents a closeform iterative algorithm to process multi-scale image segmentation and depth inferring alternately, which can significantly improve segmentation and depth estimate results. First, an EM-based algorithm is applied to obtain an initial multi-scale image segmentation result. Then, the multiscale Markov random field (MRF) model, trained by supervised learning, is used to infer both depths and the relations between depths at different image regions. Next, a graph-based region merging algorithm is applied to merge the segmentations at the larger scales by incorporating the inferred depths. At the last, the refined multi-scale image segmentations are used as input of MRF model and the depth are re-inferred. The above processes are iteratively continued until the expected results are achieved. Since there are no changes on the segmentations at the finest scale in the iterative process, it still can capture the detailed 3D structure. Meanwhile, the refined segmentations at the other scales will help obtain more global structure information in the image. The contrastive experimental results verify the validity of our method that it can infer quantitatively better depth estimations for 62.7% of 134 images downloaded from the Saxena's database. Our method can also improve the image segmentation results in the sense of scene interpretation. Moreover, the paper extends the method to estimate the depth of the scene with fore-objects.