On Estimating the Maximum Domination Value and the Skyline Cardinality of Multi-Dimensional Data Sets

  • Authors:
  • Eleftherios Tiakas;Apostolos N. Papadopoulos;Yannis Manolopoulos

  • Affiliations:
  • Department of Informatics, Aristotle University, Thessaloniki, Greece;Department of Informatics, Aristotle University, Thessaloniki, Greece;Department of Informatics, Aristotle University, Thessaloniki, Greece

  • Venue:
  • International Journal of Knowledge-Based Organizations
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The last years there is an increasing interest for query processing techniques that take into consideration the dominance relationship between items to select the most promising ones, based on user preferences. Skyline and top-k dominating queries are examples of such techniques. A skyline query computes the items that are not dominated, whereas a top-k dominating query returns the k items with the highest domination score. To enable query optimization, it is important to estimate the expected number of skyline items as well as the maximum domination value of an item. In this article, the authors provide an estimation for the maximum domination value under the dinstinct values and attribute independence assumptions. The authors provide three different methodologies for estimating and calculating the maximum domination value and the authors test their performance and accuracy. Among the proposed estimation methods, their method Estimation with Roots outperforms all others and returns the most accurate results. They also introduce the eliminating dimension, i.e., the dimension beyond which all domination values become zero, and the authors provide an efficient estimation of that dimension. Moreover, the authors provide an accurate estimation of the skyline cardinality of a data set.