Accurate single image multi-modal camera pose estimation

Authors:
Christoph Bodensteiner;Marcus Hebel;Michael Arens
Affiliations:
Fraunhofer IOSB, Ettlingen, Germany;Fraunhofer IOSB, Ettlingen, Germany;Fraunhofer IOSB, Ettlingen, Germany
Venue:
ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part II
Year:
2010

Citing 15
Cited 1

Model-based object pose in 25 lines of code

International Journal of Computer Vision - Special issue: image understanding research at the University of Maryland
Alignment by Maximization of Mutual Information

International Journal of Computer Vision
The visualization toolkit (2nd ed.): an object-oriented approach to 3D graphics

The visualization toolkit (2nd ed.): an object-oriented approach to 3D graphics
Fast and Globally Convergent Pose Estimation from Video Images

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiple View Geometry in Computer Vision

Multiple View Geometry in Computer Vision
SoftPOSIT: Simultaneous Pose and Correspondence Determination

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
Point-Based Graphics

Point-Based Graphics
Automatic Alignment of Color Imagery onto 3D Laser Radar Data

AIPR '06 Proceedings of the 35th Applied Imagery and Pattern Recognition Workshop
Homography-based 2D Visual Tracking and Servoing

International Journal of Robotics Research
Robust Object Detection with Interleaved Categorization and Segmentation

International Journal of Computer Vision
Speeded-Up Robust Features (SURF)

Computer Vision and Image Understanding
A Comparative Analysis of RANSAC Techniques Leading to Adaptive Real-Time Random Sample Consensus

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part II
EPnP: An Accurate O(n) Solution to the PnP Problem

International Journal of Computer Vision

Automatic unconstrained online configuration of a master-slave camera system

ICVS'13 Proceedings of the 9th international conference on Computer Vision Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A well known problem in photogrammetry and computer vision is the precise and robust determination of camera poses with respect to a given 3D model. In this work we propose a novel multi-modal method for single image camera pose estimation with respect to 3D models with intensity information (e.g., LiDAR data with reflectance information). We utilize a direct point based rendering approach to generate synthetic 2D views from 3D datasets in order to bridge the dimensionality gap. The proposed method then establishes 2D/2D point and local region correspondences based on a novel self-similarity distance measure. Correct correspondences are robustly identified by searching for small regions with a similar geometric relationship of local self-similarities using a Generalized Hough Transform. After backprojection of the generated features into 3D a standard Perspective-n-Points problem is solved to yield an initial camera pose. The pose is then accurately refined using an intensity based 2D/3D registration approach. An evaluation on Vis/IR 2D and airborne and terrestrial 3D datasets shows that the proposed method is applicable to a wide range of different sensor types. In addition, the approach outperforms standard global multi-modal 2D/3D registration approaches based on Mutual Information with respect to robustness and speed. Potential applications are widespread and include for instance multi-spectral texturing of 3D models, SLAM applications, sensor data fusion and multi-spectral camera calibration and super-resolution applications.