Aggregate operators in probabilistic databases

Authors:
Robert Ross;V. S. Subrahmanian;John Grant
Affiliations:
University of Maryland, College Park, MD;University of Maryland, College Park, MD;Towson University, Towson, MD
Venue:
Journal of the ACM (JACM)
Year:
2005

Citing 27
Cited 26

Quantitative deduction and its fixpoint theory

Journal of Logic Programming
On the semantics of rule-based expert systems with uncertainty

Lecture notes in computer science on ICDT '88
Introduction to algorithms

Introduction to algorithms
A logic for reasoning about probabilities

Information and Computation - Selections from 1988 IEEE symposium on logic in computer science
Bilattices and the semantics of logic programming

Journal of Logic Programming
A theory of nonmonotonic inheritance based on annotated logic

Artificial Intelligence
Probabilistic logic programming

Information and Computation
Probabilistic information retrieval as a combination of abstraction, inductive learning, and probabilistic assumptions

ACM Transactions on Information Systems (TOIS)
Stable semantics for probabilistic deductive databases

Information and Computation
Probabilistic deductive databases

ILPS '94 Proceedings of the 1994 International Symposium on Logic programming
A probabilistic relational model and algebra

ACM Transactions on Database Systems (TODS)
ProbView: a flexible probabilistic database system

ACM Transactions on Database Systems (TODS)
Supporting valid-time indeterminacy

ACM Transactions on Database Systems (TODS)
Probabilistic temporal databases, I: algebra

ACM Transactions on Database Systems (TODS)
Discovering outlier filtering rules from unlabeled data: combining a supervised learner with an unsupervised learner

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Information Retrieval

Information Retrieval
Database System Concepts

Database System Concepts
Computers and Intractability: A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness
The Management of Probabilistic Data

IEEE Transactions on Knowledge and Data Engineering
A Parametric Approach to Deductive Databases with Uncertainty

IEEE Transactions on Knowledge and Data Engineering
Database Support for Problematic Knowledge

EDBT '92 Proceedings of the 3rd International Conference on Extending Database Technology: Advances in Database Technology
The Theory of Probabilistic Databases

VLDB '87 Proceedings of the 13th International Conference on Very Large Data Bases
Probabilistic Aggregates

ISMIS '02 Proceedings of the 13th International Symposium on Foundations of Intelligent Systems
Modeling Uncertainty in Deductive Databases

DEXA '94 Proceedings of the 5th International Conference on Database and Expert Systems Applications
Action Recognition Using Probabilistic Parsing

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Evaluating probabilistic queries over imprecise data

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Stereo Depth Estimation: A Confidence Interval Approach

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision

MYSTIQ: a system for finding more answers by using probabilities

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
OLAP over uncertain and imprecise data

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Creating probabilistic databases from information extraction models

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
OLAP over uncertain and imprecise data

The VLDB Journal — The International Journal on Very Large Data Bases
Energy and quality aware query processing in wireless sensor database systems

Information Sciences: an International Journal
Management of probabilistic data: foundations and challenges

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient aggregation algorithms for probabilistic data

SODA '07 Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms
Efficient query evaluation on probabilistic databases

The VLDB Journal — The International Journal on Very Large Data Bases
Probabilistic ranked queries in uncertain databases

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Probabilistic top-k and ranking-aggregate queries

ACM Transactions on Database Systems (TODS)
Aggregates in Generalized Temporally Indeterminate Databases

SUM '07 Proceedings of the 1st international conference on Scalable Uncertainty Management
Estimating and bounding aggregations in databases with referential integrity errors

Proceedings of the ACM 11th international workshop on Data warehousing and OLAP
Information Extraction

Foundations and Trends in Databases
Maintaining consistency of vague databases using data dependencies

Data & Knowledge Engineering
The trichotomy of HAVING queries on a probabilistic database

The VLDB Journal — The International Journal on Very Large Data Bases
Extended aggregations for databases with referential integrity issues

Data & Knowledge Engineering
Efficient evaluation of HAVING queries on a probabilistic database

DBPL'07 Proceedings of the 11th international conference on Database programming languages
Computing a k-route over uncertain geographical data

SSTD'07 Proceedings of the 10th international conference on Advances in spatial and temporal databases
Handling inconsistency of vague relations with functional dependencies

ER'07 Proceedings of the 26th international conference on Conceptual modeling
A hybrid object based model combining probability and fuzzy set theories

International Journal of Intelligent Information and Database Systems
Efficiently computing and querying multidimensional OLAP data cubes over probabilistic relational data

ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
Specifying aggregation functions in multidimensional models with OCL

ER'10 Proceedings of the 29th international conference on Conceptual modeling
Continuous probabilistic count queries in wireless sensor networks

SSTD'11 Proceedings of the 12th international conference on Advances in spatial and temporal databases
Probabilistic query answering over inconsistent databases

Annals of Mathematics and Artificial Intelligence
AN EFFICIENT REPRESENTATION MODEL OF DISTANCE DISTRIBUTION BETWEEN UNCERTAIN OBJECTS

Computational Intelligence
Top-k best probability queries and semantics ranking properties on probabilistic databases

Data & Knowledge Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Though extensions to the relational data model have been proposed in order to handle probabilistic information, there has been very little work to date on handling aggregate operators in such databases. In this article, we present a very general notion of an aggregate operator and show how classical aggregation operators (such as COUNT, SUM, etc.) as well as statistical operators (such as percentiles, variance, etc.) are special cases of this general definition. We devise a formal linear programming based semantics for computing aggregates over probabilistic DBMSs, develop algorithms that satisfy this semantics, analyze their complexity, and introduce several families of approximation algorithms that run in polynomial time. We implemented all of these algorithms and tested them on a large set of data to help determine when each one is preferable.