Object detection, shape recovery, and 3D modelling by depth-encoded hough voting

Authors:
Min Sun;Shyam Sunder Kumar;Gary Bradski;Silvio Savarese
Affiliations:
-;-;-;-
Venue:
Computer Vision and Image Understanding
Year:
2013

Citing 47
Cited 0

Recognizing solid objects by alignment with an image

International Journal of Computer Vision
Plenoptic modeling: an image-based rendering system

SIGGRAPH '95 Proceedings of the 22nd annual conference on Computer graphics and interactive techniques
Modeling and rendering architecture from photographs: a hybrid geometry- and image-based approach

SIGGRAPH '96 Proceedings of the 23rd annual conference on Computer graphics and interactive techniques
Light field rendering

SIGGRAPH '96 Proceedings of the 23rd annual conference on Computer graphics and interactive techniques
Tour into the picture: using a spidery mesh interface to make animation from a single image

Proceedings of the 24th annual conference on Computer graphics and interactive techniques
The digital Michelangelo project: 3D scanning of large statues

Proceedings of the 27th annual conference on Computer graphics and interactive techniques
Real-time 3D model acquisition

Proceedings of the 29th annual conference on Computer graphics and interactive techniques
A Theory of Shape by Space Carving

International Journal of Computer Vision - Special issue on Genomic Signal Processing
The Visual Hull Concept for Silhouette-Based Image Understanding

IEEE Transactions on Pattern Analysis and Machine Intelligence
Calibrated, Registered Images of an Extended Urban Area

International Journal of Computer Vision
Visual navigation using a single camera

ICCV '95 Proceedings of the Fifth International Conference on Computer Vision
Texture Synthesis by Non-Parametric Sampling

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Poisson image editing

ACM SIGGRAPH 2003 Papers
Sea of Images: A Dense Sampling Approach for Rendering Large Indoor Environments

IEEE Computer Graphics and Applications
Visual Modeling with a Hand-Held Camera

International Journal of Computer Vision
Modelling and Interpretation of Architecture from Several Images

International Journal of Computer Vision
The Princeton Shape Benchmark

SMI '04 Proceedings of the Shape Modeling International 2004
"GrabCut": interactive foreground extraction using iterated graph cuts

ACM SIGGRAPH 2004 Papers
High-quality video view interpolation using a layered representation

ACM SIGGRAPH 2004 Papers
Metric 3D Reconstruction and Texture Acquisition of Surfaces of Revolution from a Single Uncalibrated View

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Automatic photo pop-up

ACM SIGGRAPH 2005 Papers
Partial and approximate symmetry detection for 3D geometry

ACM SIGGRAPH 2006 Papers
SmoothSketch: 3D free-form shapes from complex sketches

ACM SIGGRAPH 2006 Papers
Photo tourism: exploring photo collections in 3D

ACM SIGGRAPH 2006 Papers
A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Single View Reconstruction of Curved Surfaces

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
3D Reconstruction by Shadow Carving: Theory and Practical Evaluation

International Journal of Computer Vision
Scene completion using millions of photographs

ACM SIGGRAPH 2007 papers
Example-based 3D scan completion

SGP '05 Proceedings of the third Eurographics symposium on Geometry processing
Groups of Adjacent Contour Segments for Object Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Sketching reality: Realistic interpretation of architectural designs

ACM Transactions on Graphics (TOG)
Deep photo: model-based photograph enhancement and viewing

ACM SIGGRAPH Asia 2008 papers
Seam carving for media retargeting

Communications of the ACM - Rural engineering development
View Synthesis for Recognizing Unseen Poses of Object Classes

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
Make3D: Learning 3D Scene Structure from a Single Still Image

IEEE Transactions on Pattern Analysis and Machine Intelligence
Using Multi-view Recognition and Meta-data Annotation to Guide a Robot's Attention

International Journal of Robotics Research
Non-parametric Single View Reconstruction of Curved Objects Using Convex Optimization

Proceedings of the 31st DAGM Symposium on Pattern Recognition
Symmetric architecture modeling with a single image

ACM SIGGRAPH Asia 2009 papers
Close-range scene segmentation and reconstruction of 3D point cloud maps for mobile manipulation in domestic environments

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Error-tolerant image compositing

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Discriminative learning with latent variables for cluttered indoor scene understanding

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Depth-encoded hough voting for joint object detection and shape recovery

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Thinking inside the box: using appearance models and context based on room geometry

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Representations and Techniques for 3D Object Recognition & Scene Interpretation

Representations and Techniques for 3D Object Recognition & Scene Interpretation
Efficient structured prediction for 3D indoor scene understanding

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Detecting objects, estimating their pose, and recovering their 3D shape are critical problems in many vision and robotics applications. This paper addresses the above needs using a two stages approach. In the first stage, we propose a new method called DEHV - Depth-Encoded Hough Voting. DEHV jointly detects objects, infers their categories, estimates their pose, and infers/decodes objects depth maps from either a single image (when no depth maps are available in testing) or a single image augmented with depth map (when this is available in testing). Inspired by the Hough voting scheme introduced in [1], DEHV incorporates depth information into the process of learning distributions of image features (patches) representing an object category. DEHV takes advantage of the interplay between the scale of each object patch in the image and its distance (depth) from the corresponding physical patch attached to the 3D object. Once the depth map is given, a full reconstruction is achieved in a second (3D modelling) stage, where modified or state-of-the-art 3D shape and texture completion techniques are used to recover the complete 3D model. Extensive quantitative and qualitative experimental analysis on existing datasets [2-4] and a newly proposed 3D table-top object category dataset shows that our DEHV scheme obtains competitive detection and pose estimation results. Finally, the quality of 3D modelling in terms of both shape completion and texture completion is evaluated on a 3D modelling dataset containing both in-door and out-door object categories. We demonstrate that our overall algorithm can obtain convincing 3D shape reconstruction from just one single uncalibrated image.