Static compiler analysis for workflow provenance

  • Authors:
  • Devarshi Ghoshal;Arun Chauhan;Beth Plale

  • Affiliations:
  • Indiana University, Bloomington, IN;Indiana University, Bloomington, IN;Indiana University, Bloomington, IN

  • Venue:
  • WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data provenance is the lineage of an artifact or object. Provenance can provide a basis upon which data can be regenerated, and can be used to determine the quality of both the process and provenance itself. Provenance capture from workflows is comprised of capturing data dependencies as and when a workflow executes. We propose a layered provenance model which identifies and stores provenance at different granularities statically by analyzing the source code of programs. We use this model to capture provenance from both workflows and modules within workflows. This paper contributes a static compile time analysis methodology that includes a logical layered provenance model to convert workflow provenance from black box to white box, where the precise mapping between the inputs and outputs of a task can be known.