An optimal algorithm for approximate nearest neighbor searching fixed dimensions
Journal of the ACM (JACM)
Multiple View Geometry in Computer Vision
Multiple View Geometry in Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Photo tourism: exploring photo collections in 3D
ACM SIGGRAPH 2006 Papers
Scalable Recognition with a Vocabulary Tree
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Image Based Localization in Urban Environments
3DPVT '06 Proceedings of the Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06)
SIFT Flow: Dense Correspondence across Different Scenes
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
ViewFocus: explore places of interests on Google maps using photos with view direction filtering
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Efficiently locating photographs in many panoramas
Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems
Retrieving landmark and non-landmark images from community photo collections
Proceedings of the international conference on Multimedia
Spatial coding for large scale partial-duplicate web image search
Proceedings of the international conference on Multimedia
Beyond GPS: determining the camera viewing direction of a geotagged image
Proceedings of the international conference on Multimedia
Location recognition using prioritized feature matching
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Accurate image localization based on google maps street view
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Geotagging in multimedia and computer vision--a survey
Multimedia Tools and Applications
The social camera: a case-study in contextual image recommendation
Proceedings of the 16th international conference on Intelligent user interfaces
Active query sensing for mobile location search
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Towards low bit rate mobile visual search with multiple-channel coding
MM '11 Proceedings of the 19th ACM international conference on Multimedia
City-scale landmark identification on mobile devices
CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Fast image-based localization using direct 2D-to-3D matching
ICCV '11 Proceedings of the 2011 International Conference on Computer Vision
Robust and accurate mobile visual localization and its applications
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special Sections on the 20th Anniversary of ACM International Conference on Multimedia, Best Papers of ACM Multimedia 2012
Listen, look, and gotcha: instant video search with mobile phones by layered audio-video indexing
Proceedings of the 21st ACM international conference on Multimedia
GIANT: geo-informative attributes for location recognition and exploration
Proceedings of the 21st ACM international conference on Multimedia
Augmented and interactive video playback based on global camera pose
Proceedings of the 21st ACM international conference on Multimedia
City-view image retrieval leveraging check-in data
Proceedings of the 2nd ACM international workshop on Geotagging and its applications in multimedia
Orientation data correction with georeferenced mobile videos
Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Hi-index | 0.00 |
While on the go, more and more people are using their phones to enjoy ubiquitous location-based services (LBS). One of the fundamental problems of LBS is localization. Researchers are now investigating ways to use a phone-captured image for localization as it contains more scene context information than the embedded sensors. In this paper, we present a novel approach to mobile visual localization that accurately senses geographic scene context according to the current image (typically associated with a rough GPS position). Unlike most existing visual localization methods, the proposed approach is capable of providing a complete set of more accurate parameters about the scene geo---including the actual locations of both the mobile user and perhaps more importantly the captured scene along with the viewing direction. Our approach takes advantage of advanced techniques for large-scale image retrieval and 3D model reconstruction from photos. Specifically, we first perform joint geo-visual clustering in the cloud to generate scene clusters, with each scene represented by a 3D model. The 3D scene models are then indexed using a visual vocabulary tree structure. The phone-captured image is used to retrieve the relevant scene models, then aligned with the models, and further registered to the real-world map. Our approach achieves an estimation accuracy of user location within 14 meters, viewing direction within 9 degrees, and scene location within 21 meters. Such a complete set of accurate geo-parameters can lead to various LBS applications for routing that cannot be achieved with most existing methods. In particular, we showcase three novel applications: 1) accurate self-localization, 2) collaborative localization for rendezvous routing, and 3) routing for photographing. The evaluations through user studies indicate these applications are effective for facilitating the perfect rendezvous for mobile users.