Inverted File Partitioning Schemes in Multiple Disk Systems
IEEE Transactions on Parallel and Distributed Systems
Adding compression to a full-text retrieval system
Software—Practice & Experience
Term-weighting approaches in automatic text retrieval
Readings in information retrieval
Geospatial mapping and navigation of the web
Proceedings of the 10th international conference on World Wide Web
Proceedings of the 11th international conference on World Wide Web
Modern Information Retrieval
Computing Geographical Scopes of Web Resources
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Web-a-where: geotagging web content
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Three-level caching for efficient query processing in large Web search engines
WWW '05 Proceedings of the 14th international conference on World Wide Web
Hybrid index structures for location-based web search
Proceedings of the 14th ACM international conference on Information and knowledge management
Inverted files for text search engines
ACM Computing Surveys (CSUR)
Geographically focused collaborative crawling
Proceedings of the 15th international conference on World Wide Web
Efficient query processing in geographic web search engines
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Processing Spatial-Keyword (SK) Queries in Geographic Information Retrieval (GIR) Systems
SSDBM '07 Proceedings of the 19th International Conference on Scientific and Statistical Database Management
Introduction to Information Retrieval
Introduction to Information Retrieval
Keyword Search on Spatial Databases
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Forward Decay: A Practical Time Decay Model for Streaming Systems
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Keyword Search in Spatial Databases: Towards Searching by Document
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Efficient retrieval of the top-k most relevant spatial web objects
Proceedings of the VLDB Endowment
Maintaining time-decaying stream aggregates
Journal of Algorithms
Supporting location-based approximate-keyword queries
Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems
Hybrid indexing and seamless ranking of spatial and textual features of web documents
DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
Retrieving top-k prestige-based relevant spatial web objects
Proceedings of the VLDB Endowment
Spatio-textual indexing for geographical search on the web
SSTD'05 Proceedings of the 9th international conference on Advances in Spatial and Temporal Databases
Hi-index | 0.00 |
There is a significant commercial and research interest in location-based web search engines. Given a number of search keywords and one or more locations (geographical points) that a user is interested in, a location-based web search retrieves and ranks the most textually and spatially relevant web pages. In this type of search, both the spatial and textual information should be indexed. Currently, no efficient index structure exists that can handle both the spatial and textual aspects of data simultaneously and accurately. Existing approaches either index space and text separately or use inefficient hybrid index structures with poor performance and inaccurate results. Moreover, most of these approaches cannot accurately rank web-pages based on a combination of space and text and are not easy to integrate into existing search engines. In this paper, we propose a new index structure called Spatial-Keyword Inverted File for Points to handle point-based indexing of web documents in an integrated/efficient manner. To seamlessly find and rank relevant documents, we develop a new distance measure called spatial tf-idf. We propose four variants of spatial-keyword relevance scores and two algorithms to perform top-k searches. As verified by experiments, our proposed techniques outperform existing index structures in terms of search performance and accuracy.