Multimedia multimodal geocoding

  • Authors:
  • Lin Tzy Li;Daniel Carlos Guimarães Pedronette;Jurandy Almeida;Otávio A. B. Penatti;Rodrigo Tripodi Calumby;Ricardo da S. Torres

  • Affiliations:
  • University of Campinas (UNICAMP), Campinas, SP -- Brazil and Telecommunications Res. & Dev. Center, CPqD Foundation, Campinas, SP -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil and University of Feira de Santana (UEFS), Feira de Santana, BA -- Brazil;University of Campinas (UNICAMP), Campinas, SP -- Brazil

  • Venue:
  • Proceedings of the 20th International Conference on Advances in Geographic Information Systems
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work is developed in the context of the placing task of the MediaEval 2011 initiative. The objective is to geocode (or geotag) a set of videos, i.e., automatically assign geographical coordinates to them. This paper presents an architecture for multimodal geocoding that exploits both visual and textual descriptions associated with videos. This work also describes our efforts regarding the implementation of this architecture to demonstrate its applicability. Conducted experiments show how our multimodal approach enhances the results compared to relying on a single modality.