Multimodal location estimation on Flickr videos

Authors:
Gerald Friedland;Jaeyoung Choi;Howard Lei;Adam Janin
Affiliations:
International Computer Science Institute, Berkeley, CA, USA;International Computer Science Institute, Berkeley, CA, USA;International Computer Science Institute, Berkeley, CA, USA;International Computer Science Institute, Berkeley, CA, USA
Venue:
WSM '11 Proceedings of the 3rd ACM SIGMM international workshop on Social media
Year:
2011

Citing 11
Cited 4

Image Based Localization in Urban Environments

3DPVT '06 Proceedings of the Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06)
Flickr tag recommendation based on collective knowledge

Proceedings of the 17th international conference on World Wide Web
Methods for extracting place semantics from Flickr tags

ACM Transactions on the Web (TWEB)
What Does the Sky Tell Us about the Camera?

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part IV
Estimating Geo-temporal Location of Stationary Cameras Using Shadow Trajectories

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Placing flickr photos on a map

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Multimodal location estimation

Proceedings of the international conference on Multimedia
Cybercasing the joint: on the privacy implications of geo-tagging

HotSec'10 Proceedings of the 5th USENIX conference on Hot topics in security
Automatic tagging and geotagging in video collections and communities

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Methods for extracting place semantics from Flickr tags

ACM Transactions on the Web (TWEB)

WSM2011: third ACM workshop on social media

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Geo-visual ranking for location prediction of social images

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
A study on the accuracy of Flickr's geotag data

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Human vs machine: establishing a human baseline for multimodal location estimation

Proceedings of the 21st ACM international conference on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

The following article describes an approach to determine the geo-coordinates of the recording place of Flickr videos based on both textual metadata and visual cues. The system is tested on the MediaEval 2010 Placing Task evaluation data, which consists of 5091 unfiltered test videos. The system presented in this article is less complex, uses less training data, and is at the same time more accurate than the best system presented in the evaluation in August 2010. The performance peaks at being able to classify 14% of the videos with less than 10m accuracy. The article describes the realization of the system, analyses of the different uses of multimodal cues and gazetteer information.