Supporting top-k join queries in relational databases

Authors:
Ihab F. Ilyas;Walid G. Aref;Ahmed K. Elmagarmid
Affiliations:
School of Computer Science, University of Waterloo, Canada;Department of Computer Sciences, Purdue University, USA;Purdue University, USA
Venue:
The VLDB Journal — The International Journal on Very Large Data Bases
Year:
2004

Citing 19
Cited 64

Optimization of parallel query execution plans in XPRS

Distributed and Parallel Databases - Selected papers from the first international conference on parallel and distributed information systems
Dataflow query execution in a parallel main-memory environment

Distributed and Parallel Databases - Selected papers from the first international conference on parallel and distributed information systems
On saying “Enough already!” in SQL

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
PREDATOR: an OR-DBMS with enhanced data types

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Ripple joins for online aggregation

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Combining fuzzy information from multiple systems

Journal of Computer and System Sciences
Rank aggregation methods for the Web

Proceedings of the 10th international conference on World Wide Web
Optimal aggregation algorithms for middleware

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Minimal probing: supporting expensive predicates for top-k queries

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Top-k selection queries over relational databases: Mapping strategies and performance evaluation

ACM Transactions on Database Systems (TODS)
Access path selection in a relational database management system

SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
Reducing the Braking Distance of an SQL Query Engine

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Optimizing Multi-Feature Queries for Image Databases

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Dynamic Pipeline Scheduling for Improving Interactive Query Performance

Proceedings of the 27th International Conference on Very Large Data Bases
Supporting Incremental Join Queries on Ranked Inputs

Proceedings of the 27th International Conference on Very Large Data Bases
Query Processing Issues in Image(Multimedia) Databases

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Towards Efficient Multi-Feature Queries in Heterogeneous Environments

ITCC '01 Proceedings of the International Conference on Information Technology: Coding and Computing
Evaluating Top-k Queries over Web-Accessible Databases

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Joining ranked inputs in practice

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases

Rank-aware query optimization

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Constrained optimalities in query personalization

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Efficient and self-tuning incremental query expansion for top-k query processing

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
RankSQL: supporting ranking queries in relational database management systems

VLDB '05 Proceedings of the 31st international conference on Very large data bases
The Indiana Center for Database Systems at Purdue University

ACM SIGMOD Record
Performance of query processing implementations in ranking-based text retrieval systems using inverted indices

Information Processing and Management: an International Journal
Database support for matching: limitations and opportunities

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Continuous monitoring of top-k queries over sliding windows

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Answering top-k queries using views

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
IO-Top-k: index-access optimized top-k query processing

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Adaptive rank-aware query optimization in relational databases

ACM Transactions on Database Systems (TODS)
Spark: top-k keyword query in relational databases

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Efficient top-k processing in large-scaled distributed environments

Data & Knowledge Engineering
Relational link-based ranking

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Depth estimation for ranking query optimization

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Anytime measures for top-k algorithms

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Supporting time-constrained SQL queries in oracle

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Region clustering based evaluation of multiple top-N selection queries

Data & Knowledge Engineering
Probabilistic ranked queries in uncertain databases

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Evaluating rank joins with optimal cost

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A survey of top-k query processing techniques in relational database systems

ACM Computing Surveys (CSUR)
Joining the results of heterogeneous search engines

Information Systems
Anytime measures for top-k algorithms on exact and fuzzy data sets

The VLDB Journal — The International Journal on Very Large Data Bases
Depth estimation for ranking query optimization

The VLDB Journal — The International Journal on Very Large Data Bases
Probabilistic Inverse Ranking Queries over Uncertain Data

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
Finding the K highest-ranked answers in a distributed network

Computer Networks: The International Journal of Computer and Telecommunications Networking
Robust and efficient algorithms for rank join evaluation

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Ranking Approximate Answers to Semantic Web Queries

ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
Answering linear optimization queries with an approximate stream index

Knowledge and Information Systems
Engineering search computing applications: vision and challenges

Proceedings of the the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Optimal algorithms for evaluating rank joins in database systems

ACM Transactions on Database Systems (TODS)
Finding Top-k Approximate Answers to Path Queries

FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Processing top-N relational queries by learning

Journal of Intelligent Information Systems
Ranking genes based on kernels

Intelligent Decision Technologies - Special issue on advances in medical intelligent decision support systems
Probabilistic ranking over relations

Proceedings of the 13th International Conference on Extending Database Technology
Relational algebra for ranked tables with similarities: properties and implementation

IDA'07 Proceedings of the 7th international conference on Intelligent data analysis
Expressive and flexible access to web-extracted data: a keyword-based structured query language

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Computing the top-k maximal answers in a join of ranked lists

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Proximity rank join

Proceedings of the VLDB Endowment
Building ranked mashups of unstructured sources with uncertain information

Proceedings of the VLDB Endowment
Probabilistic inverse ranking queries in uncertain databases

The VLDB Journal — The International Journal on Very Large Data Bases
Providing built-in keyword search capabilities in RDBMS

The VLDB Journal — The International Journal on Very Large Data Bases
The rank join problem

Search computing
Proximity rank join in search computing

Search computing
Uncertainty in rank join

Search computing
Run-time adaptivity for search computing

Search computing
Score-consistent algebraic optimization of full-text search queries with GRAFT

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Exploiting correlation to rank database query results

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications: Part II
A survey on representation, composition and application of preferences in database systems

ACM Transactions on Database Systems (TODS)
Parallel data access for multiway rank joins

ICWE'11 Proceedings of the 11th international conference on Web engineering
A general top-k algorithm for web data sources

DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part I
PIQL: success-tolerant query processing in the cloud

Proceedings of the VLDB Endowment
Top-k skyline: a unified approach

OTM'05 Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems
Proximity measures for rank join

ACM Transactions on Database Systems (TODS)
Chapter 11: rank-join algorithms for search computing

Search Computing
On optimality-ratio and coverage in ranking of joined search results

Distributed and Parallel Databases
Top-k linked data query processing

ESWC'12 Proceedings of the 9th international conference on The Semantic Web: research and applications
Being picky: processing top-k queries with set-defined selections

Proceedings of the 21st ACM international conference on Information and knowledge management
A preference-aware query model for data web services

ER'12 Proceedings of the 31st international conference on Conceptual Modeling
Pragmatic correlation analysis for probabilistic ranking over relational data

Expert Systems with Applications: An International Journal
Computing immutable regions for subspace top-k queries

Proceedings of the VLDB Endowment
Determining the relative accuracy of attributes

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Shortlisting top-K assignments

Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Provisional reporting for rank joins

Journal of Intelligent Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Ranking queries, also known as top-k queries, produce results that are ordered on some computed score. Typically, these queries involve joins, where users are usually interested only in the top-k join results. Top-k queries are dominant in many emerging applications, e.g., multimedia retrieval by content, Web databases, data mining, middlewares, and most information retrieval applications. Current relational query processors do not handle ranking queries efficiently, especially when joins are involved. In this paper, we address supporting top-k join queries in relational query processors. We introduce a new rank-join algorithm that makes use of the individual orders of its inputs to produce join results ordered on a user-specified scoring function. The idea is to rank the join results progressively during the join operation. We introduce two physical query operators based on variants of ripple join that implement the rank-join algorithm. The operators are nonblocking and can be integrated into pipelined execution plans. We also propose an efficient heuristic designed to optimize a top-k join query by choosing the best join order. We address several practical issues and optimization heuristics to integrate the new join operators in practical query processors. We implement the new operators inside a prototype database engine based on PREDATOR. The experimental evaluation of our approach compares recent algorithms for joining ranked inputs and shows superior performance.