Extension of the one-shot semijoin strategy to minimize data transmission cost in distributed query processing

  • Authors:
  • Faïza Najjar;Yahya Slimani

  • Affiliations:
  • -;-

  • Venue:
  • Information Sciences: an International Journal
  • Year:
  • 1999

Quantified Score

Hi-index 0.07

Visualization

Abstract

The optimization of general queries in a distributed database management system is an important and challenging research issue. The problem is to find an optimal evaluation strategy for a given general query. In this paper, we propose an approach based on a combination of join and parallel semijoin operations to minimize the amount of data transmission in distributed query processing. First, we describe an efficient distributed query processing strategy using only semijoins. This strategy selects an optimal set instead of a sequence of semijoins to be executed in parallel in three phases: a projection phase, a transmission phase and a reduction phase. Then, we apply a sequence of joins as reducers for query processing. Furthermore, we consider the problem of finding an optimal general sequence that fully reduces a general join query graph. We present a new method that combines parallel and sequential semijoins. We report on experiments that show that our approach based on parallel semijoins is not only efficient but also effective in reducing the total amount of data transmission required to process distributed queries.