Parallel Distributed Processing of Constrained Skyline Queries by Filtering

  • Authors:
  • Bin Cui;Hua Lu;Quanqing Xu;Lijiang Chen;Yafei Dai;Yongluan Zhou

  • Affiliations:
  • Department of Computer Science, Peking University, China. bin.cui@pku.edu.cn;Department of Computer Science, Aalborg University, Denmark. luhua@cs.aau.dk;Department of Computer Science, Peking University, China. xqq@pku.edu.cn;Department of Computer Science, Peking University, China. clj@pku.edu.cn;Department of Computer Science, Peking University, China. dyf@pku.edu.cn;Distributed Information Systems Laboratory, EPFL, Switzerland. yongluan.zhou@epfl.ch

  • Venue:
  • ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Skyline queries are capable of retrieving interesting points from a large data set according to multiple criteria. Most work on skyline queries so far has assumed a centralized storage, whereas in practice relevant data are often distributed among geographically scattered sites. In this work, we tackle constrained skyline queries in large-scale distributed environments without the assumption of any overlay structures, and propose a novel algorithm named PaDSkyline (Parallel Distributed Skyline query processing). PaDSkyline significantly shortens the response time by performing parallel processing over site groups produced by a partition algorithm. Within each group, it locally optimizes the query processing over distributed sites. It also drastically enhances the network transmission efficiency by performing early reduction of skyline candidates with deliberately selected multiple filtering points. Results of extensive experiments demonstrate the efficiency and robustness of our proposals.