Extracting Geographic Context from the Web: GeoReferencing in MyMoSe

  • Authors:
  • Álvaro Zubizarreta;Pablo Fuente;José M. Cantera;Mario Arias;Jorge Cabrero;Guido García;César Llamas;Jesús Vegas

  • Affiliations:
  • GRINBD, Departamento de Informática, Universidad de Valladolid, Spain;GRINBD, Departamento de Informática, Universidad de Valladolid, Spain;Telefónica I+D, Parque Tecnológico Boecillo, Boecillo, Spain 47151;GRINBD, Departamento de Informática, Universidad de Valladolid, Spain;GRINBD, Departamento de Informática, Universidad de Valladolid, Spain;Telefónica I+D, Parque Tecnológico Boecillo, Boecillo, Spain 47151;GRINBD, Departamento de Informática, Universidad de Valladolid, Spain;GRINBD, Departamento de Informática, Universidad de Valladolid, Spain

  • Venue:
  • ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many Web pages are clearly related to specific locations. Identifying this geographic focus is the cornerstone of the next generation of geographic context aware search services. This paper shows a multistage method for assigning a geographic focus to Web pages (GeoReferencing), using several heuristics for toponym disambiguation and a scoring function for focus determination. We provide an experimental methodology for evaluating the accuracy of the system with Web pages in English and Spanish. Finally, we have obtained promising results, reaching an accuracy of over 70% with a town-level resolution.