A Model for Geographic Knowledge Extraction on Web Documents

  • Authors:
  • Cláudio Elizio Campelo;Cláudio Souza Baptista

  • Affiliations:
  • Computer Science Department, University of Campina Grande,;Computer Science Department, University of Campina Grande,

  • Venue:
  • ER '09 Proceedings of the ER 2009 Workshops (CoMoL, ETheCoM, FP-UML, MOST-ONISW, QoIS, RIGiM, SeCoGIS) on Advances in Conceptual Modeling - Challenging Perspectives
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

There is an increasing interest on doing research in the field of information retrieval which aims to incorporate new dimensions, apart from text based retrieval, to the Web search engines. Geographical Information Retrieval (GIR) aims to index Web resources using a geographic context. The process of identifying the geographic context starts with the detection of different types of geographic references associated to the documents, as for example, the occurrence of place names. This paper presents a model for detecting geographic references in Web documents based on a set of heuristics. Moreover, new concepts and methods for disambiguation of many places with the same name are addressed. Finally, a prototype was built, called GeoSEn which aimed to validate the effectiveness of the proposed model.