Recovering Surface Layout from an Image

Authors:
Derek Hoiem;Alexei A. Efros;Martial Hebert
Affiliations:
Robotics Institute, Carnegie Mellon University, Pittsburgh, USA 15213;Robotics Institute, Carnegie Mellon University, Pittsburgh, USA 15213;Robotics Institute, Carnegie Mellon University, Pittsburgh, USA 15213
Venue:
International Journal of Computer Vision
Year:
2007

Citing 34
Cited 51

Knowledge-based interpretation of outdoor natural color scenes

Knowledge-based interpretation of outdoor natural color scenes
A Transform for Multiscale Image Segmentation by Integrated Edge and Region Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Improved Boosting Algorithms Using Confidence-rated Predictions

Machine Learning - The Eleventh Annual Conference on computational Learning Theory
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Single View Metrology

International Journal of Computer Vision
Multiple view geometry in computer vision

Multiple view geometry in computer vision
Fast Approximate Energy Minimization via Graph Cuts

IEEE Transactions on Pattern Analysis and Machine Intelligence
Image Segmentation by Data-Driven Markov Chain Monte Carlo

IEEE Transactions on Pattern Analysis and Machine Intelligence
Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons

International Journal of Computer Vision
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Logistic Regression, AdaBoost and Bregman Distances

Machine Learning
Depth Estimation from Image Structure

IEEE Transactions on Pattern Analysis and Machine Intelligence
Video Compass

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
COMPUTER RECOGNITION OF THREE-DIMENSIONAL OBJECTS IN A VISUAL SCENE

COMPUTER RECOGNITION OF THREE-DIMENSIONAL OBJECTS IN A VISUAL SCENE
Self-Calibration and Metric Reconstruction in spite of Varying and Unknown Internal Camera Parameters

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Bayesian Reconstruction of 3D Shapes and Scenes From A Single Image

HLK '03 Proceedings of the First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis
Learning a Classification Model for Segmentation

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Towards a Mathematical Theory of Primal Sketch and Sketchability

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Efficient Graph-Based Image Segmentation

International Journal of Computer Vision
Lazy snapping

ACM SIGGRAPH 2004 Papers
Image Parsing: Unifying Segmentation, Detection, and Recognition

International Journal of Computer Vision
Generalizing Swendsen-Wang to Sampling Arbitrary Posterior Probabilities

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic photo pop-up

ACM SIGGRAPH 2005 Papers
Geometric Context from a Single Image

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Learning Hierarchical Models of Scenes, Objects, and Parts

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Bottom-up/Top-Down Image Parsing by Attribute Graph Grammar

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Model Order Selection and Cue Combination for Image Segmentation

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Putting Objects in Perspective

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Depth from Familiar Objects: A Hierarchical Model for 3D Scenes

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
A Dynamic Bayesian Network Model for Autonomous 3D Reconstruction from a Single Indoor Image

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Using Multiple Segmentations to Discover Objects and their Extent in Image Collections

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Boundary Extraction in Natural Images Using Ultrametric Contour Maps

CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
Probabilistic spatial context models for scene content understanding

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition

Occlusion Boundaries from Motion: Low-Level Detection and Mid-Level Reasoning

International Journal of Computer Vision
Color learning and illumination invariance on mobile robots: A survey

Robotics and Autonomous Systems
A new localized superpixel Markov random field for image segmentation

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Image-based exploration obstacle avoidance for mobile robot

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Appearance contrast for fast, robust trail-following

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Multi-view Superpixel Stereo in Urban Environments

International Journal of Computer Vision
Superpixel analysis for object detection and tracking with application to UAV imagery

ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part I
A nonparametric learning approach to range sensing from omnidirectional vision

Robotics and Autonomous Systems
A low false negative filter for detecting rare bird species from short video segments using a probable observation data set-based EKF method

IEEE Transactions on Image Processing
A framework for photo-quality assessment and enhancement based on visual aesthetics

Proceedings of the international conference on Multimedia
Detecting ground shadows in outdoor consumer photographs

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Discriminative learning with latent variables for cluttered indoor scene understanding

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Why did the person cross the road (there)? scene understanding using probabilistic logic models and common sense reasoning

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Blocks world revisited: image understanding using qualitative geometry and mechanics

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Discriminative learning with latent variables for cluttered indoor scene understanding

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Non-local characterization of scenery images: statistics, 3D reasoning, and a generative model

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Superparsing: scalable nonparametric image parsing with superpixels

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Category independent object proposals

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Thinking inside the box: using appearance models and context based on room geometry

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Turbopixel segmentation using Eigen-images

IEEE Transactions on Image Processing
Context modeling in computer vision: techniques, implications, and applications

Multimedia Tools and Applications
Recovering Occlusion Boundaries from an Image

International Journal of Computer Vision
PixelLaser: computing range from monocular texture

ISVC'10 Proceedings of the 6th international conference on Advances in visual computing - Volume Part III
Object class segmentation using reliable regions

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Combining plane estimation with shape detection for holistic scene understanding

ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
A holistic approach to aesthetic enhancement of photographs

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section on ACM multimedia 2010 best paper candidates, and issue on social media
Purposive hidden-object game (P-HOG) towards imperceptible human computation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Harmony Potentials

International Journal of Computer Vision
Estimating the Natural Illumination Conditions from a Single Outdoor Image

International Journal of Computer Vision
Real-time estimation of 3D scene geometry from a single image

Pattern Recognition
Evaluating visual aesthetics in photographic portraiture

CAe '12 Proceedings of the Eighth Annual Symposium on Computational Aesthetics in Graphics, Visualization, and Imaging
Orientation-aware scene understanding for mobile cameras

Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Object Detection using Geometrical Context Feedback

International Journal of Computer Vision
Mobile robot 3D map building and path planning based on multi-sensor data fusion

International Journal of Computer Applications in Technology
On learning higher-order consistency potentials for multi-class pixel labeling

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Constrained semi-supervised learning using attributes and comparative attributes

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Beyond the line of sight: labeling the underlying surfaces

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Efficient exact inference for 3d indoor scene understanding

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Road scene segmentation from a single image

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VII
Joint spatio-temporal depth features fusion framework for 3d structure estimation in urban environment

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part III
Human-centric indoor environment modeling from depth videos

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume 2
Superparsing

International Journal of Computer Vision
Discriminative learning with latent variables for cluttered indoor scene understanding

Communications of the ACM
Joint kernel learning for supervised image segmentation

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Detecting changes in images of street scenes

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part IV
Depth synthesis and local warps for plausible image-based navigation

ACM Transactions on Graphics (TOG)
Render synthetic fog into interior and exterior photographs

Proceedings of the 12th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and Its Applications in Industry
Probabilistic Joint Image Segmentation and Labeling by Figure-Ground Composition

International Journal of Computer Vision
Efficient semantic image segmentation with multi-class ranking prior

Computer Vision and Image Understanding
C2TAM: A Cloud framework for cooperative tracking and mapping

Robotics and Autonomous Systems
A flexible architecture for multi-view 3DTV based on uncalibrated cameras

Journal of Visual Communication and Image Representation

Quantified Score

Hi-index	0.02

Visualization

Abstract

Humans have an amazing ability to instantly grasp the overall 3D structure of a scene--ground orientation, relative positions of major landmarks, etc.--even from a single image. This ability is completely missing in most popular recognition algorithms, which pretend that the world is flat and/or view it through a patch-sized peephole. Yet it seems very likely that having a grasp of this "surface layout" of a scene should be of great assistance for many tasks, including recognition, navigation, and novel view synthesis.In this paper, we take the first step towards constructing the surface layout, a labeling of the image intogeometric classes. Our main insight is to learn appearance-based models of these geometric classes, which coarsely describe the 3D scene orientation of each image region. Our multiple segmentation framework provides robust spatial support, allowing a wide variety of cues (e.g., color, texture, and perspective) to contribute to the confidence in each geometric label. In experiments on a large set of outdoor images, we evaluate the impact of the individual cues and design choices in our algorithm. We further demonstrate the applicability of our method to indoor images, describe potential applications, and discuss extensions to a more complete notion of surface layout.