Incomplete Information in Relational Databases
Journal of the ACM (JACM)
Maintaining views incrementally
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Principles of programming with complex objects and collection types
ICDT '92 Selected papers of the fourth international conference on Database theory
A probabilistic relational algebra for the integration of information retrieval and database systems
ACM Transactions on Information Systems (TOIS)
Query evaluation in probabilistic relational databases
Selected papers from the international workshop on Uncertainty in databases and deductive systems
Tracing the lineage of view data in a warehousing environment
ACM Transactions on Database Systems (TODS)
Foundations of Databases: The Logical Level
Foundations of Databases: The Logical Level
Aggregate Queries Over Conditional Tables
Journal of Intelligent Information Systems
Why and Where: A Characterization of Data Provenance
ICDT '01 Proceedings of the 8th International Conference on Database Theory
A Calculus for Collections and Aggregates
CTCS '97 Proceedings of the 7th International Conference on Category Theory and Computer Science
Containment of aggregate queries
ACM SIGMOD Record
Rewriting queries with arbitrary aggregation functions using views
ACM Transactions on Database Systems (TODS)
ORCHESTRA: facilitating collaborative data sharing
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Update exchange with mappings and provenance
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Databases with uncertainty and lineage
The VLDB Journal — The International Journal on Very Large Data Bases
Annotated XML: queries and provenance
Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
On the expressiveness of implicit provenance in query and update languages
ACM Transactions on Database Systems (TODS)
Conditioning probabilistic databases
Proceedings of the VLDB Endowment
On the provenance of non-answers to queries over extracted data
Proceedings of the VLDB Endowment
Mapping the NRC Dataflow Model to the Open Provenance Model
Provenance and Annotation of Data and Processes
Proceedings of the 12th International Conference on Database Theory
Containment of conjunctive queries on annotated relations
Proceedings of the 12th International Conference on Database Theory
A formal model of provenance in distributed systems
TAPP'09 First workshop on on Theory and practice of provenance
Probabilistic databases: diamonds in the dirt
Communications of the ACM - Barbara Liskov: ACM's A.M. Turing Award Winner
Provenance in Databases: Why, How, and Where
Foundations and Trends in Databases
Proceedings of the 24th ACM SIGPLAN conference companion on Object oriented programming systems languages and applications
The VLDB Journal — The International Journal on Very Large Data Bases
A unified approach to ranking in probabilistic databases
Proceedings of the VLDB Endowment
Incremental query evaluation in a ring of databases
Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient querying and maintenance of network provenance at internet-scale
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
The complexity of causality and responsibility for query answers and non-answers
Proceedings of the VLDB Endowment
Collaborative data sharing with mappings and provenance
Collaborative data sharing with mappings and provenance
Managing lineage and uncertainty under a data exchange setting
SUM'10 Proceedings of the 4th international conference on Scalable uncertainty management
On the expressiveness of implicit provenance in query and update languages
ICDT'07 Proceedings of the 11th international conference on Database Theory
Putting lipstick on pig: enabling database-style workflow provenance
Proceedings of the VLDB Endowment
Aggregation in probabilistic databases via knowledge compilation
Proceedings of the VLDB Endowment
Tiresias: the database oracle for how-to queries
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Combining dependent annotations for relational algebra
Proceedings of the 15th International Conference on Database Theory
Labeling workflow views with fine-grained dependencies
Proceedings of the VLDB Endowment
Semiring-annotated data: queries and provenance?
ACM SIGMOD Record
Algebraic structures for capturing the provenance of SPARQL queries
Proceedings of the 16th International Conference on Database Theory
Proceedings of the 16th International Conference on Database Theory
On scaling up sensitive data auditing
Proceedings of the VLDB Endowment
PROPOLIS: provisioned analysis of data-centric processes
Proceedings of the VLDB Endowment
Classification of annotation semirings over containment of conjunctive queries
ACM Transactions on Database Systems (TODS)
Anytime approximation in probabilistic databases
The VLDB Journal — The International Journal on Very Large Data Bases
Hi-index | 0.00 |
We study in this paper provenance information for queries with aggregation. Provenance information was studied in the context of various query languages that do not allow for aggregation, and recent work has suggested to capture provenance by annotating the different database tuples with elements of a commutative semiring and propagating the annotations through query evaluation. We show that aggregate queries pose novel challenges rendering this approach inapplicable. Consequently, we propose a new approach, where we annotate with provenance information not just tuples but also the individual values within tuples, using provenance to describe the values computation. We realize this approach in a concrete construction, first for "simple" queries where the aggregation operator is the last one applied, and then for arbitrary (positive) relational algebra queries with aggregation; the latter queries are shown to be more challenging in this context. Finally, we use aggregation to encode queries with difference, and study the semantics obtained for such queries on provenance annotated databases.