Distributed Query Processing in the Internet: Exploring Relation Replication and Network Characteristics

Authors:
Affiliations:
Venue:
ICDCS '01 Proceedings of the The 21st International Conference on Distributed Computing Systems
Year:
2001

Citing 14
Cited 1

Distributed query processing

ACM Computing Surveys (CSUR)
Introduction to algorithms

Introduction to algorithms
Principles of database query processing for advanced applications

Principles of database query processing for advanced applications
The state of the art in distributed query processing

ACM Computing Surveys (CSUR)
Considering data skew factor in multi-way join query optimization for parallel execution

The VLDB Journal — The International Journal on Very Large Data Bases - Parallelism in database systems
A Graph Theoretical Approach to Determine a Join Reducer Sequence in Distributed Query Processing

IEEE Transactions on Knowledge and Data Engineering
Optimal Allocation for Partially Replicated Database Systems on Ring Networks

IEEE Transactions on Knowledge and Data Engineering
Combinatorial Optimization of Distributed Queries

IEEE Transactions on Knowledge and Data Engineering
Optimization of Parallel Execution for Multi-Join Queries

IEEE Transactions on Knowledge and Data Engineering
On the Complexity of Distributed Query Optimization

IEEE Transactions on Knowledge and Data Engineering
A Parallel Sort Merge Join Algorithm for Managing Data Skew

IEEE Transactions on Parallel and Distributed Systems
Combining Joint and Semi-Join Operations for Distributed Query Processing

IEEE Transactions on Knowledge and Data Engineering
Join Index Hierarchies for Supporting Efficient Navigations in Object-Oriented Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
A Scheme for High-Performance Data Delivery in the Web Environment

ICPADS '98 Proceedings of the 1998 International Conference on Parallel and Distributed Systems

Efficient data distribution strategy for join query processing in the cloud

Proceedings of the third international workshop on Cloud data management

Quantified Score

Hi-index	0.00

Visualization

Abstract

Abstract: We introduce the concept of network graph for distributed query processing. Semijoins and joins are termed contributive replicated semijoins and contributed replicated joins, respectively, when they are interleaved into a join sequence to reduce the amount of data transmission cost required in a network with replicated relations. Our solution procedure consists of three consecutive steps, namely relation selection, join sequence scheduling and merge processing. A simulator is developed to evaluate the performance of algorithms devised. Our results show that the approach of interleaving a join sequence with contributive replicated semijoins/joins is not only efficient in its execution but also effective in reducing the total amount of data transmission cost required to process distributed queries.