Structure from Motion: Beyond the Epipolar Constraint

Authors:
Tomáš Brodský;Cornelia Fermüller;Yiannis Aloimonos
Affiliations:
Computer Vision Laboratory, Center for Automation Research, University of Maryland, College Park, MD 20742-3275, USA. tbr@philabs.research.philips.com;Computer Vision Laboratory, Center for Automation Research, University of Maryland, College Park, MD 20742-3275, USA. fer@cfar.umd.edu;Computer Vision Laboratory, Center for Automation Research, University of Maryland, College Park, MD 20742-3275, USA. yiannis@cfar.umd.edu
Venue:
International Journal of Computer Vision
Year:
2000

Citing 25
Cited 8

Algorithm for analysing optical flow based on the least-squares method

Image and Vision Computing
Scene Segmentation from Visual Motion Using Global Optimization

IEEE Transactions on Pattern Analysis and Machine Intelligence
Image Flow Segmentation and Estimation by Constraint Line Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
Introduction to algorithms

Introduction to algorithms
Three-dimensional computer vision: a geometric viewpoint

Three-dimensional computer vision: a geometric viewpoint
Performance of optical flow techniques

International Journal of Computer Vision
A maximum likelihood stereo algorithm

Computer Vision and Image Understanding
Directions of Motion Fields are Hardly Ever Ambiguous

International Journal of Computer Vision
Effects of errors in the viewing geometry on shape estimation

Computer Vision and Image Understanding
Robot Vision

Robot Vision
A Multiple-Baseline Stereo

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multimodal Estimation of Discontinuous Optical Flow using Markov Random Fields

IEEE Transactions on Pattern Analysis and Machine Intelligence
Hierarchical Model-Based Motion Estimation

ECCV '92 Proceedings of the Second European Conference on Computer Vision
Dense Depth Map Reconstruction: A Minimization and Regularization Approach which Preserves Discontinuities

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume I - Volume I
3D Model Acquisition from Extended Image Sequences

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume II - Volume II
Automatic Camera Recovery for Closed or Open Image Sequences

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume I - Volume I
Complete Dense Stereovision Using Level Set Methods

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume I - Volume I
Multi Viewpoint Stereo from Uncalibrated Video Sequences

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume I - Volume I
Discrete-Time Rigidity-Constrained Optical Flow

CAIP '97 Proceedings of the 7th International Conference on Computer Analysis of Images and Patterns
Simultaneous Estimation of Viewing Geometry and Structure

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume I - Volume I
Photorealistic Scene Reconstruction by Voxel Coloring

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Generalized image matching by the method of differences

Generalized image matching by the method of differences
Stereo Matching with Transparency and Matting

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Self-Calibration from Image Derivatives

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Self-Calibration and Metric Reconstruction in spite of Varying and Unknown Internal Camera Parameters

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision

Eyes from Eyes

SMILE '00 Revised Papers from Second European Workshop on 3D Structure from Multiple Images of Large-Scale Environments
Computer Vision: Past and Future

Informatics - 10 Years Back. 10 Years Ahead.
Motion Segmentation Using Occlusions

IEEE Transactions on Pattern Analysis and Machine Intelligence
A 3D Shape Constraint on Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
Joint optical flow estimation, segmentation, and 3D interpretation with level sets

Computer Vision and Image Understanding
Optical Flow 3D Segmentation and Interpretation: A Variational Method with Active Curve Evolution and Level Sets

IEEE Transactions on Pattern Analysis and Machine Intelligence
A variational method for the recovery of dense 3D structure from motion

Robotics and Autonomous Systems
Optimal pixel aspect ratio for enhanced 3D TV visualization

Computer Vision and Image Understanding

Quantified Score

Hi-index	0.00

Visualization

Abstract

The classic approach to structure from motion entails a clear separation between motion estimation and structure estimation and between two-dimensional (2D) and three-dimensional (3D) information. For the recovery of the rigid transformation between different views only 2D image measurements are used. To have available enough information, most existing techniques are based on the intermediate computation of optical flow which, however, poses a problem at the locations of depth discontinuities. If we knew where depth discontinuities were, we could (using a multitude of approaches based on smoothness constraints) accurately estimate flow values for image patches corresponding to smooth scene patches; but to know the discontinuities requires solving the structure from motion problem first. This paper introduces a novel approach to structure from motion which addresses the processes of smoothing, 3D motion and structure estimation in a synergistic manner. It provides an algorithm for estimating the transformation between two views obtained by either a calibrated or uncalibrated camera. The results of the estimation are then utilized to perform a reconstruction of the scene from a short sequence of images.The technique is based on constraints on image derivatives which involve the 3D motion and shape of the scene, leading to a geometric and statistical estimation problem. The interaction between 3D motion and shape allows us to estimate the 3D motion while at the same time segmenting the scene. If we use a wrong 3D motion estimate to compute depth, we obtain a distorted version of the depth function. The distortion, however, is such that the worse the motion estimate, the more likely we are to obtain depth estimates that vary locally more than the correct ones. Since local variability of depth is due either to the existence of a discontinuity or to a wrong 3D motion estimate, being able to differentiate between these two cases provides the correct motion, which yields the “least varying” estimated depth as well as the image locations of scene discontinuities. We analyze the new constraints, show their relationship to the minimization of the epipolar constraint, and present experimental results using real image sequences that indicate the robustness of the method.