Algorithms for finding patterns in strings
Handbook of theoretical computer science (vol. A)
Database abstractions: aggregation and generalization
ACM Transactions on Database Systems (TODS)
Conceptual-model-based data extraction from multiple-record Web pages
Data & Knowledge Engineering
Geospatial mapping and navigation of the web
Proceedings of the 10th international conference on World Wide Web
Global SourceBook of Address Data Management: A Guide to Address Formats and Data in 194 Countries
Global SourceBook of Address Data Management: A Guide to Address Formats and Data in 194 Countries
Spatial information retrieval and geographical ontologies an overview of the SPIRIT project
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Toward the semantic geospatial web
Proceedings of the 10th ACM international symposium on Advances in geographic information systems
Computing Geographical Scopes of Web Resources
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Core Elements of Digital Gazetteers: Placenames, Categories, and Footprints
ECDL '00 Proceedings of the 4th European Conference on Research and Advanced Technology for Digital Libraries
A Small Set of Formal Topological Relationships Suitable for End-User Interaction
SSD '93 Proceedings of the Third International Symposium on Advances in Spatial Databases
Data Provenance: Some Basic Issues
FST TCS 2000 Proceedings of the 20th Conference on Foundations of Software Technology and Theoretical Computer Science
Mastering Regular Expressions
Web-a-where: geotagging web content
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Toward semantic understanding: an approach based on information extraction ontologies
ADC '04 Proceedings of the 15th Australasian database conference - Volume 27
On assigning place names to geography related web pages
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Detecting geographic locations from web resources
Proceedings of the 2005 workshop on Geographic information retrieval
The Role of Gazetteers in Geographic Knowledge Discovery on the Web
LA-WEB '05 Proceedings of the Third Latin American Web Congress
Experiments with geographic knowledge for information extraction
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Discovering geographic locations in web pages using urban addresses
Proceedings of the 4th ACM workshop on Geographical information retrieval
Robust location search from text queries
Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems
Introduction to digital gazetteer research
International Journal of Geographical Information Science - Digital Gazetteer Research
Modelling vague places with knowledge from the Web
International Journal of Geographical Information Science - Digital Gazetteer Research
Location approximation for local search services using natural language hints
International Journal of Geographical Information Science
Hi-index | 0.00 |
When users need to find something on the Web that is related to a place, chances are place names will be submitted along with some other keywords to a search engine. However, automatic recognition of geographic characteristics embedded in Web documents, which would allow for a better connection between documents and places, remains a difficult task. We propose an ontology-driven approach to facilitate the process of recognizing, extracting, and geocoding partial or complete references to places embedded in text. Our approach combines an extraction ontology with urban gazetteers and geocoding techniques. This ontology, called OnLocus, is used to guide the discovery of geospatial evidence from the contents of Web pages. We show that addresses and positioning expressions, along with fragments such as postal codes or telephone area codes, provide satisfactory support for local search applications, since they are able to determine approximations to the physical location of services and activities named within Web pages. Our experiments show the feasibility of performing automated address extraction and geocoding to identify locations associated to Web pages. Combining location identifiers with basic addresses improved the precision of extractions and reduced the number of false positive results.