Multimedia multimodal geocoding

Authors:
Lin Tzy Li;Daniel Carlos Guimarães Pedronette;Jurandy Almeida;Otávio A. B. Penatti;Rodrigo Tripodi Calumby;Ricardo da S. Torres
Affiliations:
University of Campinas (UNICAMP), Campinas, SP -- Brazil and Telecommunications Res. & Dev. Center, CPqD Foundation, Campinas, SP -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil and University of Feira de Santana (UEFS), Feira de Santana, BA -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil
Venue:
Proceedings of the 20th International Conference on Advances in Geographic Information Systems
Year:
2012

Citing 11
Cited 2

Introduction to Information Retrieval

Introduction to Information Retrieval
Geographical information retrieval

International Journal of Geographical Information Science
Learning to rank for content-based image retrieval

Proceedings of the international conference on Multimedia information retrieval
Geotagging in multimedia and computer vision--a survey

Multimedia Tools and Applications
VIRaL: Visual Image Retrieval and Localization

Multimedia Tools and Applications
Finding locations of flickr resources using language models and similarity search

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Automatic tagging and geotagging in video collections and communities

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Exploiting clustering approaches for image re-ranking

Journal of Visual Languages and Computing
A hierarchical, multi-modal approach for placing videos on the map using millions of Flickr photographs

SBNMA '11 Proceedings of the 2011 ACM workshop on Social and behavioural networked media access
Learning to associate relevant photos to georeferenced textual documents

Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
A visual approach for video geocoding using bag-of-scenes

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval

Domain-specific image geocoding: a case study on Virginia tech building photos

Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Tag configuration matcher for geo-tagging

Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work is developed in the context of the placing task of the MediaEval 2011 initiative. The objective is to geocode (or geotag) a set of videos, i.e., automatically assign geographical coordinates to them. This paper presents an architecture for multimodal geocoding that exploits both visual and textual descriptions associated with videos. This work also describes our efforts regarding the implementation of this architecture to demonstrate its applicability. Conducted experiments show how our multimodal approach enhances the results compared to relying on a single modality.