Mining thick skylines over large databases

Authors:
Wen Jin;Jiawei Han;Martin Ester
Affiliations:
School of Computing Science, Simon Fraser University;Department of Computer Science, Univ. of Illinois at Urbana-Champaign;School of Computing Science, Simon Fraser University
Venue:
PKDD '04 Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases
Year:
2004

Citing 0
Cited 17

Efficient computation of the skyline cube

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Refreshing the sky: the compressed skycube with efficient support for frequent updates

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Algorithms and analyses for maximal vector computation

The VLDB Journal — The International Journal on Very Large Data Bases
Processing relaxed skylines in PDMS using distributed data summaries

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Continuous k-dominant skyline computation on multidimensional data streams

Proceedings of the 2008 ACM symposium on Applied computing
Tuning the Cardinality of Skyline

Advanced Web and NetworkTechnologies, and Applications
Workload-Driven Compressed Skycube Queries in Wireless Applications

WASA '09 Proceedings of the 4th International Conference on Wireless Algorithms, Systems, and Applications
Location-aware privacy and more: a systems approach using context-aware database management systems

Proceedings of the 2nd SIGSPATIAL ACM GIS 2009 International Workshop on Security and Privacy in GIS and LBS
Progressive subspace skyline clusters mining on high dimensional data

PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
Efficient processing of ranked queries with sweeping selection

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Online subspace skyline query processing using the compressed skycube

ACM Transactions on Database Systems (TODS)
Discovering the most potential stars in social networks with infra-skyline queries

APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
On optimality-ratio and coverage in ranking of joined search results

Distributed and Parallel Databases
Domination mining and querying

DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Efficient computation of combinatorial skyline queries

Information Systems
From stars to galaxies: skyline queries on aggregate data

Proceedings of the 16th International Conference on Extending Database Technology
SkyView: a user evaluation of the skyline operator

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

People recently are interested in a new operator, called skyline [3], which returns the objects that are not dominated by any other objects with regard to certain measures in a multi-dimensional space. Recent work on the skyline operator [3,15,8,13,2] focuses on efficient computation of skylines in large databases. However, such work gives users only thin skylines, i.e., single objects, which may not be desirable in some real applications. In this paper, we propose a novel concept, called thick skyline, which recommends not only skyline objects but also their nearby neighbors within -distance. Efficient computation methods are developed including (1) two efficient algorithms, Sampling-and-Pruning and Indexing-and-Estimating, to find such thick skyline with the help of statistics or indexes in large databases, and (2) a highly efficient Microcluster-based algorithm for mining thick skyline. The Microcluster-based method not only leads to substantial savings in computation but also provides a cocise representation of the thick skyline in the case of high cardinalities. Our experimental performance study shows that the proposed methods are both efficient and effective.