Finding skylines for incomplete data

  • Authors:
  • Rahul Bharuka;P. Sreenivasa Kumar

  • Affiliations:
  • Indian Institute of Technology Madras, Chennai, India;Indian Institute of Technology Madras, Chennai, India

  • Venue:
  • ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the last decade, skyline queries have been extensively studied for different domains because of their wide applications in multi-criteria decision making and search space pruning. A skyline query returns all the interesting points in a multi-dimensional data set that are not dominated by any other point with respect to all dimensions. However, real world data sets are seldom complete, i.e. data points often have missing values in one or more dimensions. Traditional skyline query processing algorithms developed for complete data can not be easily adapted for such situations because of the non-transitive and potentially cyclic nature of dominance relation that arises in the case of incomplete data. Unfortunately, skyline query processing for such incomplete data has not received enough attention. We propose an efficient Sort-based Incomplete Data Skyline (SIDS) algorithm to compute the skyline points over incomplete data. Extensive experiments on both real world and synthetic data sets demonstrate the efficiency and scalability of our approach over current state of the art approach.