Conditioning probabilistic databases

Authors:
Christoph Koch;Dan Olteanu
Affiliations:
Cornell University, Ithaca, NY;Oxford University, Oxford, UK
Venue:
Proceedings of the VLDB Endowment
Year:
2008

Citing 20
Cited 33

Incomplete Information in Relational Databases

Journal of the ACM (JACM)
Graph-Based Algorithms for Boolean Function Manipulation

IEEE Transactions on Computers
Monte-Carlo approximation algorithms for enumeration problems

Journal of Algorithms
On the representation and querying of sets of possible worlds

Selected papers of the workshop on Deductive database theory
A probabilistic relational algebra for the integration of information retrieval and database systems

ACM Transactions on Information Systems (TOIS)
The complexity of query reliability

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
A Computing Procedure for Quantification Theory

Journal of the ACM (JACM)
An Optimal Algorithm for Monte Carlo Estimation

SIAM Journal on Computing
Approximation algorithms

Approximation algorithms
Algorithms and Data Structures in VLSI Design

Algorithms and Data Structures in VLSI Design
Clean Answers over Dirty Databases: A Probabilistic Approach

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
ULDBs: databases with uncertainty and lineage

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Management of probabilistic data: foundations and challenges

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient query evaluation on probabilistic databases

The VLDB Journal — The International Journal on Very Large Data Bases
Approximating predicates and expressive queries on probabilistic databases

Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Monte-Carlo algorithms for enumeration and reliability problems

SFCS '83 Proceedings of the 24th Annual Symposium on Foundations of Computer Science
Fast and Simple Relational Processing of Uncertain Data

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Exploiting Lineage for Confidence Computation in Uncertain and Probabilistic Databases

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
A knowledge compilation map

Journal of Artificial Intelligence Research
The good old Davis-Putnam procedure helps counting models

Journal of Artificial Intelligence Research

Managing Probabilistic Data with MystiQ: The Can-Do, the Could-Do, and the Can't-Do

SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
Using OBDDs for Efficient Query Evaluation on Probabilistic Databases

SUM '08 Proceedings of the 2nd international conference on Scalable Uncertainty Management
A compositional query algebra for second-order logic and uncertain databases

Proceedings of the 12th International Conference on Database Theory
A compositional framework for complex queries over uncertain data

Proceedings of the 12th International Conference on Database Theory
Evaluating probability threshold k-nearest-neighbor queries over uncertain data

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
On Query Algebras for Probabilistic Databases

ACM SIGMOD Record
Ranking distributed probabilistic data

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Secondary-storage confidence computation for conjunctive queries with inequalities

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Indexing correlated probabilistic databases

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
MayBMS: a probabilistic database management system

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
The trichotomy of HAVING queries on a probabilistic database

The VLDB Journal — The International Journal on Very Large Data Bases
$${10^{(10^{6})}}$$ worlds and beyond: efficient representation and processing of incomplete information

The VLDB Journal — The International Journal on Very Large Data Bases
A unified approach to ranking in probabilistic databases

Proceedings of the VLDB Endowment
Bridging the gap between intensional and extensional query evaluation in probabilistic databases

Proceedings of the 13th International Conference on Extending Database Technology
Understanding cardinality estimation using entropy maximization

Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
GRN model of probabilistic databases: construction, transition and querying

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Skyline query processing for uncertain data

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
MCDB-R: risk analysis in the database

Proceedings of the VLDB Endowment
A unified approach to ranking in probabilistic databases

The VLDB Journal — The International Journal on Very Large Data Bases
Provenance for aggregate queries

Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Querying uncertain data with aggregate constraints

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
The monte carlo database system: Stochastic analysis close to the data

ACM Transactions on Database Systems (TODS)
A truly dynamic data structure for top-k queries on uncertain data

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Understanding cardinality estimation using entropy maximization

ACM Transactions on Database Systems (TODS)
Design by example for SQL table definitions with functional dependencies

The VLDB Journal — The International Journal on Very Large Data Bases
Aggregation in probabilistic databases via knowledge compilation

Proceedings of the VLDB Endowment
MUD: Mapping-based query processing for high-dimensional uncertain data

Information Sciences: an International Journal
Probabilistic databases with MarkoViews

Proceedings of the VLDB Endowment
Human-machine cooperation with epistemological DBs: supporting user corrections to knowledge bases

AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
Mining frequent subgraphs over uncertain graph databases under probabilistic semantics

The VLDB Journal — The International Journal on Very Large Data Bases
A Regression Dependent Iterative Algorithm for Optimizing Top-K Selection in Simulation Query Language

International Journal of Decision Support System Technology
A temporal-probabilistic database model for information extraction

Proceedings of the VLDB Endowment
Anytime approximation in probabilistic databases

The VLDB Journal — The International Journal on Very Large Data Bases

Quantified Score

Hi-index	0.00

Visualization

Abstract

Past research on probabilistic databases has studied the problem of answering queries on a static database. Application scenarios of probabilistic databases however often involve the conditioning of a database using additional information in the form of new evidence. The conditioning problem is thus to transform a probabilistic database of priors into a posterior probabilistic database which is materialized for subsequent query processing or further refinement. It turns out that the conditioning problem is closely related to the problem of computing exact tuple confidence values. It is known that exact confidence computation is an NP-hard problem. This has led researchers to consider approximation techniques for confidence computation. However, neither conditioning nor exact confidence computation can be solved using such techniques. In this paper we present efficient techniques for both problems. We study several problem decomposition methods and heuristics that are based on the most successful search techniques from constraint satisfaction, such as the Davis-Putnam algorithm. We complement this with a thorough experimental evaluation of the algorithms proposed. Our experiments show that our exact algorithms scale well to realistic database sizes and can in some scenarios compete with the most efficient previous approximation algorithms.