Kepler/pPOD: Scientific Workflow and Provenance Support for Assembling the Tree of Life

  • Authors:
  • Shawn Bowers;Timothy Mcphillips;Sean Riddle;Manish Kumar Anand;Bertram Ludäscher

  • Affiliations:
  • UC Davis Genome Center, University of California, Davis,;UC Davis Genome Center, University of California, Davis,;UC Davis Genome Center, University of California, Davis,;Department of Computer Science, University of California, Davis,;UC Davis Genome Center, University of California, Davis, and Department of Computer Science, University of California, Davis,

  • Venue:
  • Provenance and Annotation of Data and Processes
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The complexity of scientific workflows for analyzing biological data creates a number of challenges for current workflow and provenance systems. This complexity is due in part to the nature of scientific data (e.g., heterogeneous, nested data collections) and the programming constructs required for automation (e.g., nested workflows, looping, pipeline parallelism). We present an extended version of the Kepler scientific workflow system to address these challenges, tailored for the systematics community. Our system combines novel approaches for representing scientific data, modeling and automating complex analyses, and recording and browsing associated provenance information.