Geo-tagging for imprecise regions of different sizes

Authors:
Robert C. Pasley;Paul D. Clough;Mark Sanderson
Affiliations:
University of Sheffield, Sheffield, UNK, United Kngdm;University of Sheffield, Sheffield, United Kngdm;University of Sheffield, Sheffield, United Kngdm
Venue:
Proceedings of the 4th ACM workshop on Geographical information retrieval
Year:
2007

Citing 15
Cited 7

Record-boundary discovery in Web documents

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Toward the semantic geospatial web

Proceedings of the 10th ACM international symposium on Advances in geographic information systems
Data Mining for Web Intelligence

Computer
Improving pseudo-relevance feedback in web information retrieval using web page segmentation

WWW '03 Proceedings of the 12th international conference on World Wide Web
Extensionality of the RCC8 composition table

Fundamenta Informaticae
Introduction to the special issue on the web as corpus

Computational Linguistics - Special issue on web as corpus
Named Entity recognition without gazetteers

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Web-a-where: geotagging web content

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Toponym resolution in text (abstract only): "which sheffield is it?"

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Detecting geographic locations from web resources

Proceedings of the 2005 workshop on Geographic information retrieval
Extracting metadata for spatially-aware information retrieval on the internet

Proceedings of the 2005 workshop on Geographic information retrieval
Bootstrapping toponym classifiers

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
A confidence-based framework for disambiguating geographic terms

HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Mining Domain-Specific Thesauri from Wikipedia: A Case Study

WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Extracting content structure for web pages based on visual representation

APWeb'03 Proceedings of the 5th Asia-Pacific web conference on Web technologies and applications

Determining geographic representations for arbitrary concepts at query time

Proceedings of the first international workshop on Location and the web
Acquisition of a vernacular gazetteer from web sources

Proceedings of the first international workshop on Location and the web
Automatic acquisition of vernacular places

Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
Map-based filters for fuzzy entities in geographical information retrieval

NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Challenges for indexing in GIR

SIGSPATIAL Special
Your mileage may vary: on the limits of social media

SIGSPATIAL Special
Learning boundaries of vague places from noisy annotations

Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Extracting geographical information from various web sources is likely to be important for a variety of applications. One such use for this information is to enable the study of vernacular regions: informal places referred to on a day-to-day basis, but with no official entry in geographical resources, such as gazetteers. Past work in automatically extracting geographical information from the web to support the creation of vernacular regions has tended to focus on larger regions (e.g. "The British Midlands" and "The South of France"). In this paper we report the results of preliminary work to investigate the success of using a simple geo-tagging approach and resources of varying granularity from the Ordnance Survey to extract geographical information from web pages. We find that the data gathered for smaller regions (compared with larger ones) is more "fine-grained" which has an effect on the type of resource most useful for geo-tagging and its success.