Provenance management in Swift

  • Authors:
  • Luiz M. R. Gadelha Jr.;Ben Clifford;Marta Mattoso;Michael Wilde;Ian Foster

  • Affiliations:
  • Computer and Systems Engineering Program, Federal University of Rio de Janeiro, Brazil and National Laboratory for Scientific Computing, Brazil;Computation Institute, University of Chicago, USA;Computer and Systems Engineering Program, Federal University of Rio de Janeiro, Brazil;Computation Institute, University of Chicago, USA and Mathematics and Computer Science Division, Argonne National Laboratory, USA;Computation Institute, University of Chicago, USA and Mathematics and Computer Science Division, Argonne National Laboratory, USA

  • Venue:
  • Future Generation Computer Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Swift parallel scripting language allows for the specification, execution and analysis of large-scale computations in parallel and distributed environments. It incorporates a data model for recording and querying provenance information. In this article we describe these capabilities and evaluate the interoperability with other systems through the use of the Open Provenance Model. We describe Swift's provenance data model and compare it to the Open Provenance Model. We also describe and evaluate activities performed within the Third Provenance Challenge, which consisted of implementing a specific scientific workflow, capturing and recording provenance information of its execution, performing provenance queries, and exchanging provenance information with other systems. Finally, we propose improvements to both the Open Provenance Model and Swift's provenance system.