Record-boundary discovery in Web documents
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Toward the semantic geospatial web
Proceedings of the 10th ACM international symposium on Advances in geographic information systems
Data Mining for Web Intelligence
Computer
Improving pseudo-relevance feedback in web information retrieval using web page segmentation
WWW '03 Proceedings of the 12th international conference on World Wide Web
Extensionality of the RCC8 composition table
Fundamenta Informaticae
Introduction to the special issue on the web as corpus
Computational Linguistics - Special issue on web as corpus
Named Entity recognition without gazetteers
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Web-a-where: geotagging web content
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Toponym resolution in text (abstract only): "which sheffield is it?"
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Detecting geographic locations from web resources
Proceedings of the 2005 workshop on Geographic information retrieval
Extracting metadata for spatially-aware information retrieval on the internet
Proceedings of the 2005 workshop on Geographic information retrieval
Bootstrapping toponym classifiers
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
A confidence-based framework for disambiguating geographic terms
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Mining Domain-Specific Thesauri from Wikipedia: A Case Study
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Extracting content structure for web pages based on visual representation
APWeb'03 Proceedings of the 5th Asia-Pacific web conference on Web technologies and applications
Determining geographic representations for arbitrary concepts at query time
Proceedings of the first international workshop on Location and the web
Acquisition of a vernacular gazetteer from web sources
Proceedings of the first international workshop on Location and the web
Automatic acquisition of vernacular places
Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
Map-based filters for fuzzy entities in geographical information retrieval
NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
Challenges for indexing in GIR
SIGSPATIAL Special
Your mileage may vary: on the limits of social media
SIGSPATIAL Special
Learning boundaries of vague places from noisy annotations
Proceedings of the 19th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Hi-index | 0.00 |
Extracting geographical information from various web sources is likely to be important for a variety of applications. One such use for this information is to enable the study of vernacular regions: informal places referred to on a day-to-day basis, but with no official entry in geographical resources, such as gazetteers. Past work in automatically extracting geographical information from the web to support the creation of vernacular regions has tended to focus on larger regions (e.g. "The British Midlands" and "The South of France"). In this paper we report the results of preliminary work to investigate the success of using a simple geo-tagging approach and resources of varying granularity from the Ordnance Survey to extract geographical information from web pages. We find that the data gathered for smaller regions (compared with larger ones) is more "fine-grained" which has an effect on the type of resource most useful for geo-tagging and its success.