Depth extraction from video using non-parametric sampling

Authors:
Kevin Karsch;Ce Liu;Sing Bing Kang
Affiliations:
University of Illinois at Urbana-Champaign;Microsoft Research, New England;Microsoft Research
Venue:
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Year:
2012

Citing 13
Cited 0

Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Bayesian Reconstruction of 3D Shapes and Scenes From A Single Image

HLK '03 Proceedings of the First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis
Automatic photo pop-up

ACM SIGGRAPH 2005 Papers
A Dynamic Bayesian Network Model for Autonomous 3D Reconstruction from a Single Indoor Image

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Example Based 3D Reconstruction from Single 2D Images

CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
Make3D: Learning 3D Scene Structure from a Single Still Image

IEEE Transactions on Pattern Analysis and Machine Intelligence
Beyond pixels: exploring new representations and applications for motion analysis

Beyond pixels: exploring new representations and applications for motion analysis
Robust Bilayer Segmentation and Motion/Depth Estimation with a Handheld Camera

IEEE Transactions on Pattern Analysis and Machine Intelligence
SIFT Flow: Dense Correspondence across Scenes and Its Applications

IEEE Transactions on Pattern Analysis and Machine Intelligence
StereoBrush: interactive 2D to 3D conversion using discontinuous warps

Proceedings of the Eighth Eurographics Symposium on Sketch-Based Interfaces and Modeling
Video Stereolization: Combining Motion Analysis with User Interaction

IEEE Transactions on Visualization and Computer Graphics
Depth Director: A System for Adding Depth to Movies

IEEE Computer Graphics and Applications
Learning the right model: Efficient max-margin learning in Laplacian CRFs

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

We describe a technique that automatically generates plausible depth maps from videos using non-parametric depth sampling. We demonstrate our technique in cases where past methods fail (non-translating cameras and dynamic scenes). Our technique is applicable to single images as well as videos. For videos, we use local motion cues to improve the inferred depth maps, while optical flow is used to ensure temporal depth consistency. For training and evaluation, we use a Kinect-based system to collect a large dataset containing stereoscopic videos with known depths. We show that our depth estimation technique outperforms the state-of-the-art on benchmark databases. Our technique can be used to automatically convert a monoscopic video into stereo for 3D visualization, and we demonstrate this through a variety of visually pleasing results for indoor and outdoor scenes, including results from the feature film Charade.