Reconciling provenance policy conflicts by inventing anonymous nodes

Authors:
Saumen Dey;Daniel Zinn;Bertram Ludäscher
Affiliations:
Dept. of Computer Science, University of California, Davis;Genome Center, University of California, Davis;Dept. of Computer Science, University of California, Davis
Venue:
ESWC'11 Proceedings of the 8th international conference on The Semantic Web
Year:
2011

Citing 15
Cited 1

Lineage retrieval for scientific data processing: a survey

ACM Computing Surveys (CSUR)
A survey of data provenance in e-science

ACM SIGMOD Record
Provenance for Visualizations: Reproducibility and Beyond

Computing in Science and Engineering
Zoom*UserViews: querying relevant provenance in workflow systems

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Special Issue: The First Provenance Challenge

Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Efficient lineage tracking for scientific workflows

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Provenance and scientific workflows: challenges and opportunities

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Provenance for Computational Tasks: A Survey

Computing in Science and Engineering
Scientific Workflow Provenance Querying with Security Views

WAIM '08 Proceedings of the 2008 The Ninth International Conference on Web-Age Information Management
Exploring Scientific Workflow Provenance Using Hybrid Queries over Nested Data and Lineage Graphs

SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Techniques for efficiently querying scientific workflow provenance graphs

Proceedings of the 13th International Conference on Extending Database Technology
Privacy issues in scientific workflow provenance

Proceedings of the 1st International Workshop on Workflow Approaches to New Data-centric Science
The Open Provenance Model core specification (v1.1)

Future Generation Computer Systems
PROPUB: towards a declarative approach for publishing customized, policy-aware provenance

SSDBM'11 Proceedings of the 23rd international conference on Scientific and statistical database management
Managing rapidly-evolving scientific workflows

IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data

A declarative approach to customize workflow provenance

Proceedings of the Joint EDBT/ICDT 2013 Workshops

Quantified Score

Hi-index	0.00

Visualization

Abstract

In scientific collaborations, provenance is increasingly used to understand, debug, and explain the processing history of data, and to determine the validity and quality of data products. While provenance is easily recorded by scientific workflow systems, it can be infeasible or undesirable to publish provenance details for all data products of a workflow run. We have developed ProPub, a system that allows users to publish a customized version of their data provenance, based on a set of publication and customization requests, while observing certain provenance publication policies, expressed as logic integrity constraints. When user requests conflict with provenance policies, repair actions become necessary. In prior work, we removed additional parts of the provenance graph (i.e., not directly requested by the user) to repair constraint violations. In this paper, we present an alternative approach, which ensures that all relevant nodes are retained in the provenance graph. The key idea is to introduce new anonymous nodes to represent lineage dependencies, without revealing information that the user wants to protect. With this new approach, a user may now explore different provenance publication strategies, and choose the most appropriate one before publishing sensitive provenance data.