On the expressiveness of implicit provenance in query and update languages

Authors:
Peter Buneman;James Cheney;Stijn Vansummeren
Affiliations:
University of Edinburgh, Scotland, UK;University of Edinburgh, Scotland, UK;Hasselt University and Transnational University of Limburg, Diepenbeek, Belgium
Venue:
ACM Transactions on Database Systems (TODS)
Year:
2008

Citing 25
Cited 23

Converting nested algebra expressions into flat algebra expressions

ACM Transactions on Database Systems (TODS)
Principles of programming with complex objects and collection types

ICDT '92 Selected papers of the fourth international conference on Database theory
Normal forms and conservative extension properties for query languages over collection types

Journal of Computer and System Sciences
On genericity and parametricity (extended abstract)

PODS '96 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Bounded fixpoints for complex objects

Theoretical Computer Science
Query languages for bags and aggregate functions

Journal of Computer and System Sciences - Special issue on principles of database systems
Tracing the lineage of view data in a warehousing environment

ACM Transactions on Database Systems (TODS)
Simulation of the nested relational algebra by the flat relational algebra, with an application to the complexity of evaluating powerset algebra expressions

Theoretical Computer Science
Foundations of Databases: The Logical Level

Foundations of Databases: The Logical Level
Programming primitives for database languages

POPL '81 Proceedings of the 8th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Supporting Fine-grained Data Lineage in a Database Visualization Environment

ICDE '97 Proceedings of the Thirteenth International Conference on Data Engineering
Towards a Language for the Fully Generic Queries

DBLP-6 Proceedings of the 6th International Workshop on Database Programming Languages
A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective

VLDB '90 Proceedings of the 16th International Conference on Very Large Data Bases
Specifying Updates in Biomedical Databases

SSDBM '99 Proceedings of the 11th International Conference on Scientific and Statistical Database Management
Querying nested collections

Querying nested collections
Kleisli, a functional query system

Journal of Functional Programming
Lineage retrieval for scientific data processing: a survey

ACM Computing Surveys (CSUR)
A survey of data provenance in e-science

ACM SIGMOD Record
MONDRIAN: Annotating and Querying Databases through Colors and Blocks

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Provenance management in curated databases

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
ULDBs: databases with uncertainty and lineage

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Provenance semirings

Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Relational completeness of query languages for annotated databases

DBPL'07 Proceedings of the 11th international conference on Database programming languages
Provenance as dependency analysis

DBPL'07 Proceedings of the 11th international conference on Database programming languages
On the expressiveness of implicit provenance in query and update languages

ICDT'07 Proceedings of the 11th international conference on Database Theory

Provenance for nested subqueries

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Provenance in Databases: Why, How, and Where

Foundations and Trends in Databases
Empowering Provenance in Data Integration

ADBIS '09 Proceedings of the 13th East European Conference on Advances in Databases and Information Systems
Provenance: a future history

Proceedings of the 24th ACM SIGPLAN conference companion on Object oriented programming systems languages and applications
A graph model of data and workflow provenance

TAPP'10 Proceedings of the 2nd conference on Theory and practice of provenance
A conceptual model and predicate language for data selection and projection based on provenance

TAPP'10 Proceedings of the 2nd conference on Theory and practice of provenance
The Foundations for Provenance on the Web

Foundations and Trends in Web Science
Is provenance logical?

Proceedings of the 4th International Workshop on Logic in Databases
A quest for beauty and wealth (or, business processes for database researchers)

Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Provenance for aggregate queries

Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
On provenance minimization

Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
PrIMe: A methodology for developing provenance-aware applications

ACM Transactions on Software Engineering and Methodology (TOSEM)
Putting lipstick on pig: enabling database-style workflow provenance

Proceedings of the VLDB Endowment
Provenance as dependency analysis

Mathematical Structures in Computer Science - Programming Language Interference and Dependence
Mob data sourcing

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
A core calculus for provenance

POST'12 Proceedings of the First international conference on Principles of Security and Trust
Functional programs that explain their work

Proceedings of the 17th ACM SIGPLAN international conference on Functional programming
On Provenance Minimization

ACM Transactions on Database Systems (TODS)
Tracing where and who provenance in Linked Data: A calculus

Theoretical Computer Science
Algebraic structures for capturing the provenance of SPARQL queries

Proceedings of the 16th International Conference on Database Theory
WebLab PROV: computing fine-grained provenance links for XML artifacts

Proceedings of the Joint EDBT/ICDT 2013 Workshops
The providence of provenance

BNCOD'13 Proceedings of the 29th British National conference on Big Data
A core calculus for provenance

Journal of Computer Security - Security and Trust Principles

Quantified Score

Hi-index	0.00

Visualization

Abstract

Information describing the origin of data, generally referred to as provenance, is important in scientific and curated databases where it is the basis for the trust one puts in their contents. Since such databases are constructed using operations of both query and update languages, it is of paramount importance to describe the effect of these languages on provenance. In this article we study provenance for query and update languages that are closely related to SQL, and compare two ways in which they can manipulate provenance so that elements of the input are rearranged to elements of the output: implicit provenance, where a query or update only provides the rearranged output, and provenance is provided implicitly by a default provenance semantics; and explicit provenance, where a query or update provides both the output and the description of the provenance of each component of the output. Although explicit provenance is in general more expressive, we show that the classes of implicit provenance operations expressible by query and update languages correspond to natural semantic subclasses of the explicit provenance queries. One of the consequences of this study is that provenance separates the expressive power of query and update languages. The model is also relevant to annotation propagation schemes in which annotations on the input to a query or update have to be transferred to the output or vice versa.