Paged similarity queries

Authors:
Enzo Seraphim;Thatyana F. Piola Seraphim;Edmilson M. Moreira;Fábio C. M. Ricotta;Caetano Traina, Jr.
Affiliations:
Institute of Engineering of Systems and Technology Information, Federal University of Itajubá(MG), Brazil;Institute of Engineering of Systems and Technology Information, Federal University of Itajubá(MG), Brazil;Institute of Engineering of Systems and Technology Information, Federal University of Itajubá(MG), Brazil;Institute of Engineering of Systems and Technology Information, Federal University of Itajubá(MG), Brazil;Institute of Mathematics and Computing, University of São Paulo at São Carlos(SP), Brazil
Venue:
Information Sciences: an International Journal
Year:
2011

Citing 13
Cited 0

Slim-Trees: High Performance Metric Trees Minimizing Overlap Between Nodes

EDBT '00 Proceedings of the 7th International Conference on Extending Database Technology: Advances in Database Technology
Processing Incremental Multidimensional Range Queries in a Direct Manipulation Visual Query

ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
M-tree: An Efficient Access Method for Similarity Search in Metric Spaces

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Reducing the Braking Distance of an SQL Query Engine

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Query processing and optimization in Oracle Rdb

The VLDB Journal — The International Journal on Very Large Data Bases
Index-driven similarity search in metric spaces (Survey Article)

ACM Transactions on Database Systems (TODS)
Efficient and self-tuning incremental query expansion for top-k query processing

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Range Nearest-Neighbor Query

IEEE Transactions on Knowledge and Data Engineering
Efficient processing of complex similarity queries in RDBMS through query rewriting

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
A performance comparison of distance-based query algorithms using R-trees in spatial databases

Information Sciences: an International Journal
People search: Searching people sharing similar interests from the Web

Journal of the American Society for Information Science and Technology
Towards a unified approach to document similarity search using manifold-ranking of blocks

Information Processing and Management: an International Journal
A survey on session detection methods in query logs and a proposal for future evaluation

Information Sciences: an International Journal

Quantified Score

Hi-index	0.07

Visualization

Abstract

An important feature of a database management systems (DBMS) is its client/server architecture, where managing shared memory among the clients and the server is always an tough issue. However, similarity queries are specially sensitive to this kind of architecture, since the answer sizes vary widely. Usually, the answers of similarity query are fully processed to be sent in full to the user, who often is interested in just parts of the answer, e.g. just few elements closer or farther to the query reference. Compelling the DBMS to retrieve the full answer, further ignoring its majority is at least a waste of server processing power. Paging the answer is a technique that splits the answer onto several pages, following client requests. Despite the success of paging on traditional queries, little work has been done to support it in similarity queries. In this work, we present a technique that not only provides paging in similarity range or k-nearest neighbor queries, but also supports them in two variations: the forward similarity query and the backward similarity query. They return elements either increasingly farther of increasingly closer to the query reference. The reported experiments show that, depending on the proportion of the interesting part over the full answer, both techniques allow answering queries much faster than it is obtained in the non-paged way.