Parallel pipelined filter ordering with precedence constraints

Authors:
Amol Deshpande;Lisa Hellerstein
Affiliations:
University of Maryland College Park, MD;Polytechnic Institute of NYU, Brooklyn, NY
Venue:
ACM Transactions on Algorithms (TALG)
Year:
2012

Citing 20
Cited 2

On the optimal nesting order for computing N-relational joins

ACM Transactions on Database Systems (TODS)
Theory of linear and integer programming

Theory of linear and integer programming
Join queries with external text sources: execution and optimization techniques

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Eddies: continuously adaptive query processing

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
WSQ/DSQ: a practical approach for combined querying of databases and the Web

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
The Throughput of Sequential Testing

Proceedings of the 8th International IPCO Conference on Integer Programming and Combinatorial Optimization
Optimization of Nonrecursive Queries

VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
Linear Programming

Operations Research
Efficient information gathering on the Internet

FOCS '96 Proceedings of the 37th Annual Symposium on Foundations of Computer Science
Adaptive ordering of pipelined stream filters

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Approximating Min Sum Set Cover

Algorithmica
Exploiting Correlated Attributes in Acquisitional Query Processing

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Learning with attribute costs

Proceedings of the thirty-seventh annual ACM symposium on Theory of computing
Operator placement for in-network stream query processing

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Flow algorithms for two pipelined filter ordering problems

Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Query optimization over web services

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
A generic flow algorithm for shared filter ordering problems

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Algorithms for distributional and adversarial pipelined filter ordering problems

ACM Transactions on Algorithms (TALG)
Flow Algorithms for Parallel Query Optimization

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
On the ε-perturbation method for avoiding degeneracy

Operations Research Letters

Max-throughput for (conservative) k-of-n testing

ISAAC'11 Proceedings of the 22nd international conference on Algorithms and Computation
Scheduling linear chain streaming applications on heterogeneous systems with failures

Future Generation Computer Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the parallel pipelined filter ordering problem, we are given a set of n filters that run in parallel. The filters need to be applied to a stream of elements, to determine which elements pass all filters. Each filter has a rate limit ri on the number of elements it can process per unit time, and a selectivity pi, which is the probability that a random element will pass the filter. The goal is to maximize throughput. This problem appears naturally in a variety of settings, including parallel query optimization in databases and query processing over Web services. We present an O(n3) algorithm for this problem, given tree-structured precedence constraints on the filters. This extends work of Condon et al. [2009] and Kodialam [2001], who presented algorithms for solving the problem without precedence constraints. Our algorithm is combinatorial and produces a sparse solution. Motivated by join operators in database queries, we also give algorithms for versions of the problem in which “filter” selectivities may be greater than or equal to 1. We prove a strong connection between the more classical problem of minimizing total work in sequential filter ordering (A), and the parallel pipelined filter ordering problem (B). More precisely, we prove that A is solvable in polynomial time for a given class of precedence constraints if and only if B is as well. This equivalence allows us to show that B is NP-Hard in the presence of arbitrary precedence constraints (since A is known to be NP-Hard in that setting).