Communications of the ACM
Geospatial mapping and navigation of the web
Proceedings of the 10th international conference on World Wide Web
Spatial information retrieval and geographical ontologies an overview of the SPIRIT project
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Computing Geographical Scopes of Web Resources
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Named Entity recognition without gazetteers
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Web-a-where: geotagging web content
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
The SPIRIT collection: an overview of a large web collection
ACM SIGIR Forum
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Bootstrapping toponym classifiers
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
A confidence-based framework for disambiguating geographic terms
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Geographic information retrieval in a mobile environment: evaluating the needs of mobile individuals
Journal of Information Science
Towards a context model driven german geo-tagging system
Proceedings of the 4th ACM workshop on Geographical information retrieval
Geo-tagging for imprecise regions of different sizes
Proceedings of the 4th ACM workshop on Geographical information retrieval
Visualising the south yorkshire floods of '07
Proceedings of the 4th ACM workshop on Geographical information retrieval
Proceedings of the first international workshop on Location and the web
Annotating and visualizing location data in geospatial web applications
Proceedings of the first international workshop on Location and the web
International Journal of Geographical Information Science
Incorporating place name extents into geo-ir ranking
Proceedings of the 17th ACM conference on Information and knowledge management
Defining imprecise regions using the web
Proceedings of the 2nd PhD workshop on Information and knowledge management
Extracting geographic features from the Internet to automatically build detailed regional gazetteers
International Journal of Geographical Information Science
Data mining of maps and their automatic region-time-theme classification
SIGSPATIAL Special
Geographic information retrieval to suit immediate surroundings
Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Proceedings of the 6th Workshop on Geographic Information Retrieval
Geographic information retrieval by topological, geographical, and conceptual matching
GeoS'07 Proceedings of the 2nd international conference on GeoSpatial semantics
Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
Map-based filters for fuzzy entities in geographical information retrieval
NLDB'11 Proceedings of the 16th international conference on Natural language processing and information systems
GeoCLEF: the CLEF 2005 cross-language geographic information retrieval track overview
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Every document has a geographical scope
Data & Knowledge Engineering
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Improving vertical geo/geo disambiguation by increasing geographical feature weights of places
Proceedings of the 2012 ACM Research in Applied Computation Symposium
International Journal of Handheld Computing Research
Hi-index | 0.00 |
This paper presents methods used to extract geospatial information from web pages for use in SPIRIT, a new Geographic Information Retrieval (GIR) system for the web. The resulting geospatial markup tools have been used to annotate around 900,000 web pages taken from a 1TB web crawl, focused on regions in the UK, France, Germany and Switzerland. This paper discusses a versatile geo-parsing tool for extracting spatial metadata based upon the GATE Information Extraction (IE) system, and a simple geo-coding program based on default sense to assign spatial coordinates to extracted locations. A preliminary analysis of markup accuracy for geo-parsing and geo-coding is provided, and an initial statistical and geographical analysis of the SPIRIT collection presented.