On the use of abstract workflows to capture scientific process provenance

  • Authors:
  • Paulo Pinheiro Da Silva;Leonardo Salayandia;Nicholas Del Rio;Ann Q. Gates

  • Affiliations:
  • Computer Science Department, University of Texas at El Paso, El Paso, Texas;Computer Science Department, University of Texas at El Paso, El Paso, Texas;Computer Science Department, University of Texas at El Paso, El Paso, Texas;Computer Science Department, University of Texas at El Paso, El Paso, Texas

  • Venue:
  • TAPP'10 Proceedings of the 2nd conference on Theory and practice of provenance
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Capturing provenance about artifacts produced by distributed scientific processes is a challenging task. For example, one approach to facilitate the execution of a scientific process in distributed environments is to break down the process into components and to create workflow specifications to orchestrate the execution of these components. However, capturing provenance in such an environment, even with the guidance of orchestration logic, is difficult because of important details that may be hidden by the component abstractions. In this paper, we show how to use abstract workflows to systematically enhance scientific processes to capture provenance at appropriate levels of detail. Abstract workflows lack the orchestration logic to execute a scientific process, and instead, are intended to document scientific processes as understood by scientists and at the level of detail that is necessary to capture provenance for scientific use. Furthermore, abstract workflows are coupled with a representation of provenance that can encode provenance generated from distributed components. We show how the approach described in this paper has been used for capturing provenance for scientific processes in the Earth science, environmental science, and solar physics domains.