The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Focused crawling: a new approach to topic-specific Web resource discovery
WWW '99 Proceedings of the eighth international conference on World Wide Web
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Breadth-first crawling yields high-quality pages
Proceedings of the 10th international conference on World Wide Web
Geospatial mapping and navigation of the web
Proceedings of the 10th international conference on World Wide Web
Accelerated focused crawling through online relevance feedback
Proceedings of the 11th international conference on World Wide Web
On the geographic location of internet resources
Proceedings of the 2nd ACM SIGCOMM Workshop on Internet measurment
Distributed Hypertext Resource Discovery Through Examples
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Computing Geographical Scopes of Web Resources
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Handbook of massive data sets
Extracting Spatial Knowledge from the Web
SAINT '03 Proceedings of the 2003 Symposium on Applications and the Internet
Web-a-where: geotagging web content
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Focused crawling for both topical relevance and quality of medical information
Proceedings of the 14th ACM international conference on Information and knowledge management
A large scale study of wireless search behavior: Google mobile search
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Geographically focused collaborative crawling
Proceedings of the 15th international conference on World Wide Web
An Introduction to Search Engines and Web Navigation
An Introduction to Search Engines and Web Navigation
Mapping and visualizing the internet
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Address extraction: extraction of location-based information from the web
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
DCbot: finding spatial information on the web
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Retrieving address-based locations from the web
Proceedings of the 2nd international workshop on Geographic information retrieval
Adaptive geospatially focused crawling
Proceedings of the 18th ACM conference on Information and knowledge management
Hi-index | 0.00 |
Local search is increasingly becoming a major focus point of research interest. It is a widely-recognized speciality search with a large application area. Its data is usually aggregated from a variety of sources. One as yet largely untapped source of location data is the WWW. Today, the Web does not explicitly reveal its location-relation; rather this information is hidden somewhere within pages' contents. To exploit such location information, we need to find, extract and geo-spatially index relevant Web pages. For an effective retrieval of such content, this paper examines the application of focused Web crawling to the geospatial domain. We describe our approach for a geo-aware focused crawling of urban areas and other regions with a high building density. We present our experimental results that give us insight into spatial Web information such as location density and link distance between topical pages. Our crawls and evaluations back our hypothesis that geospatially focused crawling is suitable for the urban geospatial topic.