Image Based Localization in Urban Environments
3DPVT '06 Proceedings of the Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06)
Flickr tag recommendation based on collective knowledge
Proceedings of the 17th international conference on World Wide Web
Methods for extracting place semantics from Flickr tags
ACM Transactions on the Web (TWEB)
What Does the Sky Tell Us about the Camera?
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part IV
Estimating Geo-temporal Location of Stationary Cameras Using Shadow Trajectories
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Placing flickr photos on a map
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Multimodal location estimation
Proceedings of the international conference on Multimedia
Cybercasing the joint: on the privacy implications of geo-tagging
HotSec'10 Proceedings of the 5th USENIX conference on Hot topics in security
Automatic tagging and geotagging in video collections and communities
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Methods for extracting place semantics from Flickr tags
ACM Transactions on the Web (TWEB)
WSM2011: third ACM workshop on social media
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Geo-visual ranking for location prediction of social images
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
A study on the accuracy of Flickr's geotag data
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Human vs machine: establishing a human baseline for multimodal location estimation
Proceedings of the 21st ACM international conference on Multimedia
Hi-index | 0.00 |
The following article describes an approach to determine the geo-coordinates of the recording place of Flickr videos based on both textual metadata and visual cues. The system is tested on the MediaEval 2010 Placing Task evaluation data, which consists of 5091 unfiltered test videos. The system presented in this article is less complex, uses less training data, and is at the same time more accurate than the best system presented in the evaluation in August 2010. The performance peaks at being able to classify 14% of the videos with less than 10m accuracy. The article describes the realization of the system, analyses of the different uses of multimodal cues and gazetteer information.