Retrieving address-based locations from the web

  • Authors:
  • Dirk Ahlers;Susanne Boll

  • Affiliations:
  • OFFIS Institute for Information Technology, Oldenburg, Germany;University of Oldenburg, Oldenburg, Germany

  • Venue:
  • Proceedings of the 2nd international workshop on Geographic information retrieval
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Geospatial search for the Web determines the relation of documents' contents to a location within a region. For some pedestrian scenarios, information at a higher granularity down to individual buildings is necessary. In this paper, we describe a process for the extraction and simultaneous verification of precise addresses on German Web pages by a validating parser. We describe how an address-level location extraction can be aided by an extensive use of previous geographic knowledge and the use of its structure. The analysis of address structure, components and dependencies leads to the design of a geoparser that determines valid addresses within unstructured Web content. We further discuss some noteworthy issues that arise within the process.