Database Support for Probabilistic Attributes and Tuples

Authors:
Sarvjeet Singh;Chris Mayfield;Rahul Shah;Sunil Prabhakar;Susanne Hambrusch;Jennifer Neville;Reynold Cheng
Affiliations:
Department of Computer Science, Purdue University, West Lafayette, Indiana, USA. sarvjeet@cs.purdue.edu;Department of Computer Science, Purdue University, West Lafayette, Indiana, USA. cmayfiel@cs.purdue.edu;Department of Computer Science, Louisiana State University, Baton Rouge, Louisiana, USA. rahul@csc.lsu.edu;Department of Computer Science, Purdue University, West Lafayette, Indiana, USA. sunil@cs.purdue.edu;Department of Computer Science, Purdue University, West Lafayette, Indiana, USA. seh@cs.purdue.edu;Department of Computer Science, Purdue University, West Lafayette, Indiana, USA. neville@cs.purdue.edu;Department of Computing, Hong Kong Polytechnic University, Kowloon, Hong Kong, China. csckcheng@comp.polyu.edu.hk
Venue:
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Year:
2008

Citing 0
Cited 35

Orion 2.0: native support for uncertain data

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Exploiting shared correlations in probabilistic databases

Proceedings of the VLDB Endowment
Top-k dominating queries in uncertain databases

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Efficient processing of probabilistic reverse nearest neighbor queries over uncertain data

The VLDB Journal — The International Journal on Very Large Data Bases
Continuously monitoring top-k uncertain data streams: a probabilistic threshold method

Distributed and Parallel Databases
$${10^{(10^{6})}}$$ worlds and beyond: efficient representation and processing of incomplete information

The VLDB Journal — The International Journal on Very Large Data Bases
Efficient join processing on uncertain data streams

Proceedings of the 18th ACM conference on Information and knowledge management
Reverse skyline search in uncertain databases

ACM Transactions on Database Systems (TODS)
Probabilistic histograms for probabilistic data

Proceedings of the VLDB Endowment
Transducing Markov sequences

Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
PODS: a new model and processing algorithms for uncertain data streams

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Threshold query optimization for uncertain data

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Querying and cleaning uncertain data

QuaCon'09 Proceedings of the 1st international conference on Quality of context
Finding the least influenced set in uncertain databases

Information Systems
Combining intensional with extensional query evaluation in tuple independent probabilistic databases

Information Sciences: an International Journal
A*-tree: a structure for storage and modeling of uncertain multidimensional arrays

Proceedings of the VLDB Endowment
Conditioning and aggregating uncertain data streams: going beyond expectations

Proceedings of the VLDB Endowment
Synopses for probabilistic data over large domains

Proceedings of the 14th International Conference on Extending Database Technology
Asymptotically efficient algorithms for skyline probabilities of uncertain data

ACM Transactions on Database Systems (TODS)
Coalescing executions for fast uncertainty analysis

Proceedings of the 33rd International Conference on Software Engineering
Top-K probabilistic closest pairs query in uncertain spatial databases

APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
Database foundations for scalable RDF processing

RW'11 Proceedings of the 7th international conference on Reasoning web: semantic technologies for the web of data
Evaluating probabilistic spatial-range closest pairs queries over uncertain objects

WAIM'11 Proceedings of the 12th international conference on Web-age information management
Interactive reasoning in uncertain RDF knowledge bases

Proceedings of the 20th ACM international conference on Information and knowledge management
Shooting top-k stars in uncertain databases

The VLDB Journal — The International Journal on Very Large Data Bases
PLR: a benchmark for probabilistic data stream management systems

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part III
White box sampling in uncertain data processing enabled by program analysis

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
CLARO: modeling and processing uncertain data streams

The VLDB Journal — The International Journal on Very Large Data Bases
Xtream: a system for continuous querying over uncertain data streams

SUM'12 Proceedings of the 6th international conference on Scalable Uncertainty Management
Probabilistic top-k dominating queries in uncertain databases

Information Sciences: an International Journal
Efficient processing of probabilistic group subspace skyline queries in uncertain databases

Information Systems
Skyline queries in crowd-enabled databases

Proceedings of the 16th International Conference on Extending Database Technology
Causality and responsibility: probabilistic queries revisited in uncertain databases

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
FusionDB: conflict management system for small-science databases

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Supporting user-defined functions on uncertain data

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

The inherent uncertainty of data present in numerous applications such as sensor databases, text annotations, and information retrieval motivate the need to handle imprecise data at the database level. Uncertainty can be at the attribute or tuple level and is present in both continuous and discrete data domains. This paper presents a model for handling arbitrary probabilistic uncertain data (both discrete and continuous) natively at the database level. Our approach leads to a natural and efficient representation for probabilistic data. We develop a model that is consistent with possible worlds semantics and closed under basic relational operators. This is the first model that accurately and efficiently handles both continuous and discrete uncertainty. The model is implemented in a real database system (PostgreSQL) and the effectiveness and efficiency of our approach is validated experimentally.