ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Video Google: A Text Retrieval Approach to Object Matching in Videos
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
A Performance Evaluation of Local Descriptors
IEEE Transactions on Pattern Analysis and Machine Intelligence
Scalable Recognition with a Vocabulary Tree
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Pattern Recognition and Machine Learning (Information Science and Statistics)
Pattern Recognition and Machine Learning (Information Science and Statistics)
Image Based Localization in Urban Environments
3DPVT '06 Proceedings of the Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06)
Speeded-Up Robust Features (SURF)
Computer Vision and Image Understanding
Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Accurate content-based video copy detection with efficient feature indexing
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Active query sensing for mobile location search
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Sorting unorganized photo sets for urban reconstruction
Graphical Models
Leveraging 3D City Models for Rotation Invariant Place-of-Interest Recognition
International Journal of Computer Vision
Videoscapes: exploring sparse, unstructured video collections
ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
Active query sensing: Suggesting the best query view for mobile visual search
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section of best papers of ACM multimedia 2011, and special section on 3D mobile multimedia
A memory efficient discriminative approach for location aided recognition
ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume Part I
Epipolar geometry estimation for urban scenes with repetitive structures
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part IV
Coupled structure-from-motion and 3D symmetry detection for urban facades
ACM Transactions on Graphics (TOG)
Hi-index | 0.00 |
We address the problem of large scale place-of-interest recognition in cell phone images of urban scenarios. Here, we go beyond what has been shown in earlier approaches by exploiting the nowadays often available 3D building information (e.g. from extruded floor plans) and massive street-view like image data for database creation. Exploiting vanishing points in query images and thus fully removing 3D rotation from the recognition problem allows then to simplify the feature invariance to a pure homothetic problem, which we show leaves more discriminative power in feature descriptors than classical SIFT. We rerank visual word based document queries using a fast stratified homothetic verification that is tailored for repetitive patterns like window grids on facades and in most cases boosts the correct document to top positions if it was in the short list. Since we exploit 3D building information, the approach finally outputs the camera pose in real world coordinates ready for augmenting the cell phone image with virtual 3D information. The whole system is demonstrated to outperform traditional approaches on city scale experiments for different sources of street-view like image data and a challenging set of cell phone images.