Continuous monitoring of skylines over uncertain data streams

Authors:
Xiaofeng Ding;Xiang Lian;Lei Chen;Hai Jin
Affiliations:
Services Computing Tech. & Sys. Lab, Cluster and Grid Computing Lab, School of Computer Science, Huazhong University of Sci. & Tech., 1037 Luoyu Road, Wuhan, Hubei, China;Department of Computer Science, HongKong University of Sci. & Tech., Clear Water Bay, Kowloon, Hong Kong, China;Department of Computer Science, HongKong University of Sci. & Tech., Clear Water Bay, Kowloon, Hong Kong, China;Services Computing Tech. & Sys. Lab, Cluster and Grid Computing Lab, School of Computer Science, Huazhong University of Sci. & Tech., 1037 Luoyu Road, Wuhan, Hubei, China
Venue:
Information Sciences: an International Journal
Year:
2012

Citing 29
Cited 5

On Finding the Maxima of a Set of Vectors

Journal of the ACM (JACM)
R-trees: a dynamic index structure for spatial searching

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
The Skyline Operator

Proceedings of the 17th International Conference on Data Engineering
Efficient Progressive Skyline Computation

Proceedings of the 27th International Conference on Very Large Data Bases
An optimal and progressive algorithm for skyline queries

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Querying Imprecise Data in Moving Object Environments

IEEE Transactions on Knowledge and Data Engineering
Stabbing the Sky: Efficient Skyline Computation over Sliding Windows

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Efficient computation of the skyline cube

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Indexing multi-dimensional uncertain data with arbitrary probability density functions

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Maintaining Sliding Window Skylines on Data Streams

IEEE Transactions on Knowledge and Data Engineering
Contour map matching for event detection in sensor networks

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
The spatial skyline queries

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Continuous Skyline Queries for Moving Objects

IEEE Transactions on Knowledge and Data Engineering
Estimating statistical aggregates on probabilistic data streams

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient continuous skyline computation

Information Sciences: an International Journal
Efficient query evaluation on probabilistic databases

The VLDB Journal — The International Journal on Very Large Data Bases
Shooting stars in the sky: an online algorithm for skyline queries

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Operator scheduling in a data stream manager

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Probabilistic skylines on uncertain data

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Approaching the skyline in Z order

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Dynamic skyline queries in metric spaces

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Monochromatic and bichromatic reverse skyline search over uncertain databases

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Event queries on correlated probabilistic streams

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Underground coal mine monitoring with wireless sensor networks

ACM Transactions on Sensor Networks (TOSN)
Top-k dominating queries in uncertain databases

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Probabilistic Skyline Operator over Sliding Windows

ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Information discovery across multiple streams

Information Sciences: an International Journal
MGRS: A multi-granulation rough set

Information Sciences: an International Journal
Distortion-free predictive streaming time-series matching

Information Sciences: an International Journal

Group skyline computation

Information Sciences: an International Journal
Learning very fast decision tree from uncertain data streams with positive and unlabeled samples

Information Sciences: an International Journal
Mining frequent patterns in a varying-size sliding window of online transactional data streams

Information Sciences: an International Journal
FARP: Mining fuzzy association rules from a probabilistic quantitative database

Information Sciences: an International Journal
Parallel skyline queries over uncertain data streams in cloud computing environments

International Journal of Web and Grid Services

Quantified Score

Hi-index	0.07

Visualization

Abstract

Uncertain data are inevitable in many applications due to various factors such as the limitations of measuring equipment and delays in data updates. Although modeling and querying uncertain data have recently attracted considerable attention from the database community, there are still many critical issues to be resolved with respect to conducting advanced analysis on uncertain data. In this paper, we study the execution of the probabilistic skyline query over uncertain data streams. We propose a novel sliding window skyline model where an uncertain tuple may take the probability to be in the skyline at a certain timestamp t. Formally, a Wp-Skyline(p,t) contains all the tuples whose probabilities of becoming skylines are at least p at timestamp t. However, in the stream environment, computing a probabilistic skyline on a large number of uncertain tuples within the sliding window is a daunting task in practice. In order to efficiently calculate Wp-Skyline, we propose an efficient and effective approach, namely the candidate list approach, which maintains lists of candidates that might become skylines in future sliding windows. We also propose algorithms that continuously monitor the newly incoming and expired data to maintain the skyline candidate set incrementally. To further reduce the computation cost of deciding whether or not a candidate tuple belongs to the skyline, we propose an enhanced refinement strategy that is based on a multi-dimensional indexing structure combined with a grouping-and-conquer strategy. To validate the effectiveness of our proposed approach, we conduct extensive experiments on both real and synthetic data sets and make comparisons with basic techniques.