Multi-modal, multi-resource methods for placing Flickr videos on the map

Authors:
Pascal Kelm;Sebastian Schmiedeke;Thomas Sikora
Affiliations:
Technische Universität Berlin, Germany;Technische Universität Berlin, Germany;Technische Universität Berlin, Germany
Venue:
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Year:
2011

Citing 10
Cited 8

Support-Vector Networks

Machine Learning
Unsupervised learning by probabilistic latent semantic analysis

Machine Learning
Introduction to MPEG-7: Multimedia Content Description Interface

Introduction to MPEG-7: Multimedia Content Description Interface
A geo-coding service encompassing a geo-parsing tool and integrated digital gazetteer service

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
World-scale mining of objects and events from community photo collections

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Lire: lucene image retrieval: an extensible java CBIR library

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Mapping the world's photos

Proceedings of the 18th international conference on World wide web
Placing flickr photos on a map

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Extracting Geospatial Entities from Wikipedia

ICSC '09 Proceedings of the 2009 IEEE International Conference on Semantic Computing
Automatic tagging and geotagging in video collections and communities

Proceedings of the 1st ACM International Conference on Multimedia Retrieval

Automatic tagging and geotagging in video collections and communities

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Geo-Location estimation of flickr images: social web based enrichment

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
A visual approach for video geocoding using bag-of-scenes

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Placing images on the world map: a microblog-based enrichment approach

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Multimodal geo-tagging in social media websites using hierarchical spatial segmentation

Proceedings of the 5th ACM SIGSPATIAL International Workshop on Location-Based Social Networks
@Phillies Tweeting from Philly? Predicting Twitter User Locations with Spatial Word Usage

ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
A study on the accuracy of Flickr's geotag data

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
A novel fusion method for integrating multiple modalities and knowledge for multimodal location estimation

Proceedings of the 2nd ACM international workshop on Geotagging and its applications in multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present three approaches for placing videos in Flickr on the world map. The toponym extraction and geo lookup approach makes use of external resources to identify toponyms in the metadata and associate them with geo-coordinates. The metadata-based region model approach uses a k-nearest-neighbour classifier trained over geographical regions. Videos are represented using their metadata in a text space with reduced dimensionality. The visual region model approach uses a support vector machine also trained over geographical regions. Videos are represented using low-level feature vectors from multiple key frames. Voting methods are used to form a single decision for each video. We compare the approaches experimentally, highlighting the importance of using appropriate metadata features and suitable regions as the basis of the region model. The best performance is achieved by the geo-lookup approach used with fallback to the visual region model when the video metadata contains no toponym.