Geospatial mapping and navigation of the web
Proceedings of the 10th international conference on World Wide Web
Mapping Cyberspace
Toward the semantic geospatial web
Proceedings of the 10th ACM international symposium on Advances in geographic information systems
Geographical Information Retrieval with Ontologies of Place
COSIT 2001 Proceedings of the International Conference on Spatial Information Theory: Foundations of Geographic Information Science
Obtaining Language Models of Web Collections Using Query-Based Sampling Techniques
HICSS '02 Proceedings of the 35th Annual Hawaii International Conference on System Sciences (HICSS'02)-Volume 3 - Volume 3
Introduction to the special issue on the web as corpus
Computational Linguistics - Special issue on web as corpus
Computational Linguistics - Special issue on web as corpus
Using the web to obtain frequencies for unseen bigrams
Computational Linguistics - Special issue on web as corpus
Web Business Intelligence: Mining the Web for Actionable Knowledge
INFORMS Journal on Computing
Named Entity recognition without gazetteers
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Web-a-where: geotagging web content
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
ACM SIGKDD Explorations Newsletter
The indexable web is more than 11.5 billion pages
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Bootstrapping toponym classifiers
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
A confidence-based framework for disambiguating geographic terms
HLT-NAACL-GEOREF '03 Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references - Volume 1
Neighborhood restrictions in geographic IR
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Spatial variation in search engine queries
Proceedings of the 17th international conference on World Wide Web
Modelling vague places with knowledge from the Web
International Journal of Geographical Information Science - Digital Gazetteer Research
The geographies of the internet
Annual Review of Information Science and Technology
Landmark extraction: a web mining approach
COSIT'05 Proceedings of the 2005 international conference on Spatial Information Theory
A large-scale study on map search logs
ACM Transactions on the Web (TWEB)
Hi-index | 0.00 |
In this paper, we describe a methodology to estimate the geographic coverage of the web without the need for secondary knowledge or complex geo-tagging. This is achieved by randomly selecting toponyms from the Ordnance Survey 50K gazetteer to create search queries and thus gather document counts from various web sources for Great Britain. The same gazetteer is then used to geo-code the results and enable mapping. To validate our approach, and demonstrate the effects of geo/non-geo and geo/geo ambiguity, we mapped the selected toponyms to Geograph, a community project that contains user generated geo-tagged photographs of the UK. Although success varies with resolution, the proposed approach is likely sufficient to be reliably used by applications exploring the geographic coverage of the web for cases where references to settlements are likely to be common. In our case, we applied the method to produce maps of web coverage for a range of sources at a resolution of 30km.