Automated packaging of bioinformatics workflows for portability and durability using makeflow

  • Authors:
  • Casey Robinson;Douglas Thain

  • Affiliations:
  • University of Notre Dame;University of Notre Dame

  • Venue:
  • WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Dependency management remains a major challenge for all forms of software. A program implemented in a given environment typically has many implicit dependencies on programs, libraries, and other objects present within that environment. Moving applications between different runtime environments is certain to fail due to the existence of those external dependencies. Workflows particularly suffer from dependency management problems, precisely because they tie together multiple independent programs into a coherent whole. To address the problem of workflow decay, we propose applying the old idea of a "linker" into the new context of workflow systems. We have implemented a linker for the Makeflow workflow system, and extended the concept to apply recursively to executables and scripted languages within the workflow. We evaluate the system by applying it to a selection of bioinformatics workflows including BLAST, BWA, and SHRiMP, enabling them to be moved across multiple computation environments. We also show that the portability provided by packaging allows for improved performance.