Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
International Journal of Computer Vision
Bayesian Reconstruction of 3D Shapes and Scenes From A Single Image
HLK '03 Proceedings of the First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis
ACM SIGGRAPH 2005 Papers
A Dynamic Bayesian Network Model for Autonomous 3D Reconstruction from a Single Indoor Image
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Example Based 3D Reconstruction from Single 2D Images
CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
Make3D: Learning 3D Scene Structure from a Single Still Image
IEEE Transactions on Pattern Analysis and Machine Intelligence
Beyond pixels: exploring new representations and applications for motion analysis
Beyond pixels: exploring new representations and applications for motion analysis
Robust Bilayer Segmentation and Motion/Depth Estimation with a Handheld Camera
IEEE Transactions on Pattern Analysis and Machine Intelligence
SIFT Flow: Dense Correspondence across Scenes and Its Applications
IEEE Transactions on Pattern Analysis and Machine Intelligence
StereoBrush: interactive 2D to 3D conversion using discontinuous warps
Proceedings of the Eighth Eurographics Symposium on Sketch-Based Interfaces and Modeling
Video Stereolization: Combining Motion Analysis with User Interaction
IEEE Transactions on Visualization and Computer Graphics
Depth Director: A System for Adding Depth to Movies
IEEE Computer Graphics and Applications
Learning the right model: Efficient max-margin learning in Laplacian CRFs
CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Hi-index | 0.00 |
We describe a technique that automatically generates plausible depth maps from videos using non-parametric depth sampling. We demonstrate our technique in cases where past methods fail (non-translating cameras and dynamic scenes). Our technique is applicable to single images as well as videos. For videos, we use local motion cues to improve the inferred depth maps, while optical flow is used to ensure temporal depth consistency. For training and evaluation, we use a Kinect-based system to collect a large dataset containing stereoscopic videos with known depths. We show that our depth estimation technique outperforms the state-of-the-art on benchmark databases. Our technique can be used to automatically convert a monoscopic video into stereo for 3D visualization, and we demonstrate this through a variety of visually pleasing results for indoor and outdoor scenes, including results from the feature film Charade.