A close-form iterative algorithm for depth inferring from a single image

Authors:
Yang Cao;Yan Xia;Zengfu Wang
Affiliations:
Automation Department, University of Science and Technology of China, Hefei, China;Automation Department, University of Science and Technology of China, Hefei, China;Automation Department, University of Science and Technology of China, Hefei, China
Venue:
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Year:
2010

Citing 19
Cited 0

Direct computation of shape cues using scale-adapted spatial derivative operators

International Journal of Computer Vision - Special issue: machine vision research at the Royal Institute of Technology
Computing Local Surface Orientation and Shape from Texture forCurved Surfaces

International Journal of Computer Vision
Tour into the picture: using a spidery mesh interface to make animation from a single image

Proceedings of the 24th annual conference on Computer graphics and interactive techniques
Shape from Shading: A Survey

IEEE Transactions on Pattern Analysis and Machine Intelligence
Computer Vision: A Modern Approach

Computer Vision: A Modern Approach
A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms

International Journal of Computer Vision
Geotensity: Combining Motion and Lighting for 3D Surface Reconstruction

International Journal of Computer Vision
Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying

IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient Graph-Based Image Segmentation

International Journal of Computer Vision
Automatic photo pop-up

ACM SIGGRAPH 2005 Papers
Geometric Context from a Single Image

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
High speed obstacle avoidance using monocular vision and reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Depth from Familiar Objects: A Hierarchical Model for 3D Scenes

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
A Dynamic Bayesian Network Model for Autonomous 3D Reconstruction from a Single Indoor Image

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
3-D Depth Reconstruction from a Single Still Image

International Journal of Computer Vision
Putting Objects in Perspective

International Journal of Computer Vision
Make3D: Learning 3D Scene Structure from a Single Still Image

IEEE Transactions on Pattern Analysis and Machine Intelligence
Make3D: depth perception from a single still image

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Depth estimation using monocular and stereo cues

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Inferring depth from a single image is a difficult task in computer vision, which needs to utilize adequate monocular cues contained in the image. Inspired by Saxena et al's work, this paper presents a closeform iterative algorithm to process multi-scale image segmentation and depth inferring alternately, which can significantly improve segmentation and depth estimate results. First, an EM-based algorithm is applied to obtain an initial multi-scale image segmentation result. Then, the multiscale Markov random field (MRF) model, trained by supervised learning, is used to infer both depths and the relations between depths at different image regions. Next, a graph-based region merging algorithm is applied to merge the segmentations at the larger scales by incorporating the inferred depths. At the last, the refined multi-scale image segmentations are used as input of MRF model and the depth are re-inferred. The above processes are iteratively continued until the expected results are achieved. Since there are no changes on the segmentations at the finest scale in the iterative process, it still can capture the detailed 3D structure. Meanwhile, the refined segmentations at the other scales will help obtain more global structure information in the image. The contrastive experimental results verify the validity of our method that it can infer quantitatively better depth estimations for 62.7% of 134 images downloaded from the Saxena's database. Our method can also improve the image segmentation results in the sense of scene interpretation. Moreover, the paper extends the method to estimate the depth of the scene with fore-objects.