Leveraging 3D City Models for Rotation Invariant Place-of-Interest Recognition

Authors:
Georges Baatz;Kevin Köser;David Chen;Radek Grzeszczuk;Marc Pollefeys
Affiliations:
Department of Computer Science, ETH Zurich, Zurich, Switzerland;Department of Computer Science, ETH Zurich, Zurich, Switzerland;Department of Electrical Engineering, Stanford University, Stanford, USA;Nokia Research at Palo Alto, Palo Alto, USA;Department of Computer Science, ETH Zurich, Zurich, Switzerland
Venue:
International Journal of Computer Vision
Year:
2012

Citing 13
Cited 4

Use of the Hough transformation to detect lines and curves in pictures

Communications of the ACM
Video Compass

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
A Performance Evaluation of Local Descriptors

IEEE Transactions on Pattern Analysis and Machine Intelligence
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Image Based Localization in Urban Environments

3DPVT '06 Proceedings of the Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06)
Speeded-Up Robust Features (SURF)

Computer Vision and Image Understanding
Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Avoiding confusing features in place recognition

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Accurate image localization based on google maps street view

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Handling urban location recognition as a 2D homothetic problem

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI

Augmented Reality: Handheld Augmented Reality involving gravity measurements

Computers and Graphics
Virtual reference view generation for CBIR-based visual pose estimation

Proceedings of the 20th ACM international conference on Multimedia
Large scale visual geo-localization of images in mountainous terrain

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
A mobile indoor navigation system interface adapted to vision-based localization

Proceedings of the 11th International Conference on Mobile and Ubiquitous Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

Given a cell phone image of a building we address the problem of place-of-interest recognition in urban scenarios. Here, we go beyond what has been shown in earlier approaches by exploiting the nowadays often available 3D building information (e.g. from extruded floor plans) and massive street-level image data for database creation. Exploiting vanishing points in query images and thus fully removing 3D rotation from the recognition problem allows then to simplify the feature invariance to a purely homothetic problem, which we show enables more discriminative power in feature descriptors than classical SIFT. We rerank visual word based document queries using a fast stratified homothetic verification that in most cases boosts the correct document to top positions if it was in the short list. Since we exploit 3D building information, the approach finally outputs the camera pose in real world coordinates ready for augmenting the cell phone image with virtual 3D information. The whole system is demonstrated to outperform traditional approaches on city scale experiments for different sources of street-level image data and a challenging set of cell phone images.