Hybrid index structures for location-based web search

Authors:
Yinghua Zhou;Xing Xie;Chuang Wang;Yuchang Gong;Wei-Ying Ma
Affiliations:
University of Sci. & Tech. of China, Hefei, Anhui, P.R. China;Microsoft Research Asia, Beijing, P.R. China;Huazhong University of Sci. & Tech., Wuhan, P.R. China;University of Sci. & Tech. of China, Hefei, Anhui, P.R. China;Microsoft Research Asia, Beijing, P.R. China
Venue:
Proceedings of the 14th ACM international conference on Information and knowledge management
Year:
2005

Citing 12
Cited 49

The R*-tree: an efficient and robust access method for points and rectangles

SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Geospatial mapping and navigation of the web

Proceedings of the 10th international conference on World Wide Web
Efficient Cost Models for Spatial Queries Using R-Trees

IEEE Transactions on Knowledge and Data Engineering
STR: A Simple and Efficient Algorithm for R-Tree Packing

ICDE '97 Proceedings of the Thirteenth International Conference on Data Engineering
Computing Geographical Scopes of Web Resources

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Global Atlas: Calibrating and Indexing Documents from the Internet in the Cartographic Paradigm

WISE '00 Proceedings of the First International Conference on Web Information Systems Engineering (WISE'00)-Volume 1 - Volume 1
Categorizing web queries according to geographical locality

CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Web-a-where: geotagging web content

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Web resource geographic location classification and detection

WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Detecting dominant locations from search queries

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Optimization of geographic area to a web page for two-dimensional range query processing

WISEW'03 Proceedings of the Fourth international conference on Web information systems engineering workshops
Retrieving regional information from web by contents localness and user location

AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology

Efficient query processing in geographic web search engines

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Robust location search from text queries

Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems
STEWARD: architecture of a spatio-textual search engine

Proceedings of the 15th annual ACM international symposium on Advances in geographic information systems
A study about browsers in the Web and the Desktop

EATIS '07 Proceedings of the 2007 Euro American conference on Telematics and information systems
Analysis of geographic queries in a search engine log

Proceedings of the first international workshop on Location and the web
Discovering gis sources on the web using summaries

Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
An Ontology-Based Index to Retrieve Documents with Geographic Information

SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
A comparison of geometric approaches to assessing spatial similarity for GIR

International Journal of Geographical Information Science
NewsStand: a new view on news

Proceedings of the 16th ACM SIGSPATIAL international conference on Advances in geographic information systems
Retrieving Documents with Geographic References Using a Spatial Index Structure Based on Ontologies

ER '08 Proceedings of the ER 2008 Workshops (CMLSA, ECDM, FP-UML, M2AS, RIGiM, SeCoGIS, WISM) on Advances in Conceptual Modeling: Challenges and Opportunities
Estimation of Geographic Relevance for Web Objects Using Probabilistic Models

W2GIS '08 Proceedings of the 8th International Symposium on Web and Wireless Geographical Information Systems
A probabilistic topic-based ranking framework for location-sensitive domain information retrieval

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
A hybrid index structure for geo-textual searches

Proceedings of the 18th ACM conference on Information and knowledge management
Custom local search

Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Efficient retrieval of the top-k most relevant spatial web objects

Proceedings of the VLDB Endowment
Exploiting geographic references of documents in a geographical information retrieval system using an ontology-based index

Geoinformatica
Supporting location-based approximate-keyword queries

Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems
Hybrid indexing and seamless ranking of spatial and textual features of web documents

DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
Retrieving top-k prestige-based relevant spatial web objects

Proceedings of the VLDB Endowment
Hyper-local, directions-based ranking of places

Proceedings of the VLDB Endowment
Faster temporal range queries over versioned text

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Location-based instant search

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Efficient processing of top-k spatial keyword queries

SSTD'11 Proceedings of the 12th international conference on Advances in spatial and temporal databases
Text vs. space: efficient geo-search query processing

Proceedings of the 20th ACM international conference on Information and knowledge management
Answering top-k similar region queries

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part I
SKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search

Geoinformatica
User oriented trajectory search for trip recommendation

Proceedings of the 15th International Conference on Extending Database Technology
Evaluating spatial keyword queries under the mapreduce framework

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications
Seal: spatio-textual similarity search

Proceedings of the VLDB Endowment
Location-aware instant search

Proceedings of the 21st ACM international conference on Information and knowledge management
Efficient safe-region construction for moving top-K spatial keyword queries

Proceedings of the 21st ACM international conference on Information and knowledge management
Keyword-based k-nearest neighbor search in spatial databases

Proceedings of the 21st ACM international conference on Information and knowledge management
A framework for efficient spatial web object retrieval

The VLDB Journal — The International Journal on Very Large Data Bases
Spatial keyword querying

ER'12 Proceedings of the 31st international conference on Conceptual Modeling
Spatio-textual similarity joins

Proceedings of the VLDB Endowment
RASIM: a rank-aware separate index method for answering top-k spatial keyword queries

World Wide Web
Moving spatial keyword queries: Formulation, methods, and analysis

ACM Transactions on Database Systems (TODS)
Spatial keyword query processing: an experimental evaluation

Proceedings of the VLDB Endowment
Scalable top-k spatial keyword search

Proceedings of the 16th International Conference on Extending Database Technology
Collective spatial keyword queries: a distance owner-driven approach

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Location-aware publish/subscribe

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Real-time push middleware and mobile application for electric vehicle smart charging and aggregation

International Journal of Communication Networks and Distributed Systems
Map search via a factor graph model

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
ST-HBase: a scalable data management system for massive geo-tagged objects

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Ranking web pages by associating keywords with locations

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Spatial keyword querying of geo-tagged web content

Proceedings of the 7th International Workshop on Ranking in Databases
User-Contributed relevance and nearest neighbor queries

SSTD'13 Proceedings of the 13th international conference on Advances in Spatial and Temporal Databases
Exploiting location information for Web search

Computers in Human Behavior
Density-based spatial keyword querying

Future Generation Computer Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

There is more and more commercial and research interest in location-based web search, i.e. finding web content whose topic is related to a particular place or region. In this type of search, location information should be indexed as well as text information. However, the index of conventional text search engine is set-oriented, while location information is two-dimensional and in Euclidean space. This brings new research problems on how to efficiently represent the location attributes of web pages and how to combine two types of indexes. In this paper, we propose to use a hybrid index structure, which integrates inverted files and R*-trees, to handle both textual and location aware queries. Three different combining schemes are studied: (1) inverted file and R*-tree double index, (2) first inverted file then R*-tree, (3) first R*-tree then inverted file. To validate the performance of proposed index structures, we design and implement a complete location-based web search engine which mainly consists of four parts: (1) an extractor which detects geographical scopes of web pages and represents geographical scopes as multiple MBRs based on geographical coordinates; (2) an indexer which builds hybrid index structures to integrate text and location information; (3) a ranker which ranks results by geographical relevance as well as non-geographical relevance; (4) an interface which is friendly for users to input location-based search queries and to obtain geographical and textual relevant results. Experiments on large real-world web dataset show that both the second and the third structures are superior in query time and the second is slightly better than the third. Additionally, indexes based on R*-trees are proven to be more efficient than indexes based on grid structures.