Automatic capture and efficient storage of e-Science experiment provenance

  • Authors:
  • Roger S. Barga;Luciano A. Digiampietri

  • Affiliations:
  • Microsoft Research, One Microsoft Way Redmond, WA 98052, U.S.A.;Institute of Computing, University of Campinas, Sao Paulo, Brazil

  • Venue:
  • Concurrency and Computation: Practice & Experience - The First Provenance Challenge
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

For the first provenance challenge, we introduce a layered model to represent workflow provenance that allows navigation from an abstract model of the experiment to instance data collected during a specific experiment run. We outline modest extensions to a commercial workflow engine so it will automatically capture provenance at workflow runtime. We also present an approach to store this provenance data in a relational database. Finally, we demonstrate how core provenance queries in the challenge can be expressed in SQL and discuss the merits of our layered representation. Copyright © 2007 John Wiley & Sons, Ltd.