Dataflow query execution in a parallel main-memory environment

Authors:
Annita N. Wilschut;Peter M. G. Apers
Affiliations:
-;-
Venue:
PDIS '91 Proceedings of the first international conference on Parallel and distributed information systems
Year:
1991

Citing 0
Cited 79

On optimal processor allocation to support pipelined hash joins

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
On parallel execution of multiple pipelined hash joins

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Parallel evaluation of multi-join queries

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Online aggregation

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
On saying “Enough already!” in SQL

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Parallel Execution of Hash Joins in Parallel Databases

IEEE Transactions on Parallel and Distributed Systems
Incremental distance join algorithms for spatial databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Ripple joins for online aggregation

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
An adaptive query execution system for data integration

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Eddies: continuously adaptive query processing

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Dataflow plan execution for software agents

AGENTS '00 Proceedings of the fourth international conference on Autonomous agents
The state of the art in distributed query processing

ACM Computing Surveys (CSUR)
Continuously adaptive continuous queries over streams

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Query processing of streamed XML data

Proceedings of the eleventh international conference on Information and knowledge management
Informix under CONTROL: Online Query Processing

Data Mining and Knowledge Discovery
Join and multi-join processing in data integration systems

Data & Knowledge Engineering
Continuous queries over data streams

ACM SIGMOD Record
Parallel query processing with zigzag trees

The VLDB Journal — The International Journal on Very Large Data Bases - Parallelism in database systems
PRISMA/DB: A Parallel, Main Memory Relational DBMS

IEEE Transactions on Knowledge and Data Engineering
Applying Segmented Right-Deep Trees to Pipelining Multiple Hash Joins

IEEE Transactions on Knowledge and Data Engineering
Query Rewriting for SWIFT (First) Answers

IEEE Transactions on Knowledge and Data Engineering
Control Versus Data Flow in Parallel Database Machines

IEEE Transactions on Parallel and Distributed Systems
A Unified Peer-to-Peer Database Framework for Scalable Service and Resource Discovery

GRID '02 Proceedings of the Third International Workshop on Grid Computing
Online Feedback for Nested Aggregate Queries with Multi-Threading

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Dynamic Pipeline Scheduling for Improving Interactive Query Performance

Proceedings of the 27th International Conference on Very Large Data Bases
Using Segmented Right-Deep Trees for the Execution of Pipelined Hash Joins

VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Parallelism in a Main-Memory DBMS: The Performance of PRISMA/DB

VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Implementation and Performance Evaluation of a Parallel Transitive Closure Algorithm on PRISMA/DB

VLDB '93 Proceedings of the 19th International Conference on Very Large Data Bases
Complex Queries in DHT-based Peer-to-Peer Networks

IPTPS '01 Revised Papers from the First International Workshop on Peer-to-Peer Systems
Efficient Querying of Distributed Resources in Mediator Systems

On the Move to Meaningful Internet Systems, 2002 - DOA/CoopIS/ODBASE 2002 Confederated International Conferences DOA, CoopIS and ODBASE 2002
Support for Mobile Location-Aware Applications in MAGNET

Revised Papers from the NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems
An XML query engine for network-bound data

The VLDB Journal — The International Journal on Very Large Data Bases
Progressive evaluation of nested aggregate queries

The VLDB Journal — The International Journal on Very Large Data Bases
Exploiting Punctuation Semantics in Continuous Data Streams

IEEE Transactions on Knowledge and Data Engineering
Issues in data stream management

ACM SIGMOD Record
Chain: operator scheduling for memory minimization in data stream systems

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Adaptive filters for continuous queries over distributed data streams

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Quality of service in an information economy

ACM Transactions on Internet Technology (TOIT)
PSoup: a system for streaming queries over streaming data

The VLDB Journal — The International Journal on Very Large Data Bases
Hash-Merge Join: A Non-blocking Join Algorithm for Producing Fast and Early Join Results

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Static optimization of conjunctive queries with sliding windows over infinite streams

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
SINA: scalable incremental processing of continuous queries in spatio-temporal databases

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Optimization of data stream processing

ACM SIGMOD Record
Operator scheduling in data stream systems

The VLDB Journal — The International Journal on Very Large Data Bases
RankSQL: query algebra and optimization for relational top-k queries

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Early hash join: a configurable algorithm for the efficient and early production of join results

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Adaptive query processing in mobile environment

MPAC '05 Proceedings of the 3rd international workshop on Middleware for pervasive and ad-hoc computing
Safety guarantee of continuous join queries over punctuated data streams

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Efficient scheduling of heterogeneous continuous queries

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
NSJ: an efficient non-blocking spatial join algorithm

GIS '06 Proceedings of the 14th annual ACM international symposium on Advances in geographic information systems
Window join approximation over data streams with importance semantics

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
The effect of reading policy on early join result production

Information Sciences: an International Journal
Streaming queries over streaming data

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
A transducer-based XML query processor

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Scheduling for shared window joins over data streams

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Querying the internet with PIER

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Tuple routing strategies for distributed eddies

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Processing sliding window multi-joins in continuous queries over data streams

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
To share or not to share?

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Algorithms and metrics for processing multiple heterogeneous continuous queries

ACM Transactions on Database Systems (TODS)
Adaptive query processing

Foundations and Trends in Databases
A strategy to develop adaptive and interactive query brokers

IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Time-completeness trade-offs in record linkage using adaptive query processing

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Exploiting join cardinality for faster hash joins

Proceedings of the 2009 ACM symposium on Applied Computing
RRPJ: result-rate based progressive relational join

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Fault-tolerant query processing in structured P2P-systems

Distributed and Parallel Databases
The declarative imperative: experiences and conjectures in distributed logic

ACM SIGMOD Record
Processing exact results for sliding window joins over data streams using disk storage

International Journal of Intelligent Information and Database Systems
R-MESHJOIN for near-real-time data warehousing

DOLAP '10 Proceedings of the ACM 13th international workshop on Data warehousing and OLAP
A disk-based, adaptive approach to memory-limited computation of windowed stream joins

DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
How soccer players would do stream joins

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Querying sliding windows over online data streams

EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
Resource optimization for processing of stream data in data warehouse environment

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Progressive high-dimensional similarity join

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Towards relaxed selection and join queries over data streams

ADBIS'12 Proceedings of the 16th East European conference on Advances in Databases and Information Systems
Optimised X-HYBRIDJOIN for near-real-time data warehousing

ADC '12 Proceedings of the Twenty-Third Australasian Database Conference - Volume 124
Data-Fu: a language and an interpreter for interaction with read/write linked data

Proceedings of the 22nd international conference on World Wide Web
A generic front-stage for semi-stream processing

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
SkySuite: a framework of skyline-join operators for static and stream environments

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, the performance of characteristics of the execution of various join-trees on a parallel DBMS are studied. The results of this study, are a step into the direction of the design of a query optimization strategy that is fit for parallel execution of complex queries.Among others, synchronization issues are identified to limit the perfo rmance gain from parallelism. A new hash-join algorithm, called Pipelining hash-join is introduced that has fewer synchronization constraints than the known hash-join algorithms. Also, the behavior of individual join operations in a join-tree is studied in a simulation experiment. The results show that the Pipelining hash-join algorithms yields a better performance for multi-join queries. Also, the format of the optimal join-tree appears to depend on the size of the operands of the join: A multi-join between small operands performs best with a bushy schedule; larger operands are better off with a linear schedule. The results from the simulation study are confirmed with an analytic model for dataflow query execution.