Efficient Processing of Top-k Queries in Uncertain Databases with x-Relations

Authors:
Ke Yi;Feifei Li;George Kollios;Divesh Srivastava
Affiliations:
Hongkong University of Science and Technology, Hong Kong;Florida State University, Tallahassee;Boston University, Boston;AT&T Labs-Research, Florham Park
Venue:
IEEE Transactions on Knowledge and Data Engineering
Year:
2008

Citing 0
Cited 24

A dynamic data structure for top-k queries on uncertain data

Theoretical Computer Science
Anytime measures for top-k algorithms on exact and fuzzy data sets

The VLDB Journal — The International Journal on Very Large Data Bases
Top-k queries on uncertain data: on score distribution and typical answers

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Probabilistic Similarity Search for Uncertain Time Series

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Probabilistic ranking over relations

Proceedings of the 13th International Conference on Extending Database Technology
Threshold-based probabilistic top-k dominating queries

The VLDB Journal — The International Journal on Very Large Data Bases
Efficient processing of exact top-k queries over disk-resident sorted lists

The VLDB Journal — The International Journal on Very Large Data Bases
Metric spaces in data mining: applications to clustering

SIGSPATIAL Special
Efficient fuzzy top-k query processing over uncertain objects

DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
Combining intensional with extensional query evaluation in tuple independent probabilistic databases

Information Sciences: an International Journal
k-nearest neighbors in uncertain graphs

Proceedings of the VLDB Endowment
Ranking queries on uncertain data

The VLDB Journal — The International Journal on Very Large Data Bases
Context-sensitive document ranking

Journal of Computer Science and Technology
A hybrid algorithm for finding top-k twig answers in probabilistic XML

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
On pruning for top-k ranking in uncertain databases

Proceedings of the VLDB Endowment
Continuous inverse ranking queries in uncertain streams

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Top-k best probability queries on probabilistic data

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part II
On the semantics of top-k ranking for objects with uncertain data

Computers & Mathematics with Applications
Efficient fuzzy ranking queries in uncertain databases

Applied Intelligence
Efficient processing of top-k twig queries over probabilistic XML data

World Wide Web
Top-K aggregate queries on continuous probabilistic datasets

WAIM'13 Proceedings of the 14th international conference on Web-Age Information Management
Entity resolution for distributed probabilistic data

Distributed and Parallel Databases
Top-k entities query processing on uncertainly fused multi-sensory data

Personal and Ubiquitous Computing
Top-k best probability queries and semantics ranking properties on probabilistic databases

Data & Knowledge Engineering

Quantified Score

Hi-index	0.01

Visualization

Abstract

This work introduces new algorithms for processing top-$k$ queries in uncertain databases, under the generally adopted model of x-relations. An x-relation consists of a number of x-tuples, and each x-tuple randomly instantiates into one tuple from one or more alternatives. Soliman et al.~\cite{soliman07} first introduced the problem of top-$k$ query processing in uncertain databases and proposed various algorithms to answer such queries. Under the x-relation model, our new results significantly improve the state of the art, in terms of both running time and memory usage. In the single-alternative case, our new algorithms are 2 to 3 orders of magnitude faster than the previous algorithms. In the multi-alternative case, the improvement is even more dramatic: while the previous algorithms have exponential complexity in both time and space, our algorithms run in near linear or low polynomial time. Our study covers both types of top-$k$ queries proposed in \cite{soliman07}. We provide both the theoretical analysis and an extensive experimental evaluation to demonstrate the superiority of the new approaches over existing solutions.