Grid workflow software for a high-throughput proteome annotation pipeline

  • Authors:
  • Adam Birnbaum;James Hayes;Wilfred W. Li;Mark A. Miller;Peter W. Arzberger;Phililp E. Bourne;Henri Casanova

  • Affiliations:
  • San Diego Supercomputer Center, La Jolla, CA;San Diego Supercomputer Center, La Jolla, CA;San Diego Supercomputer Center, La Jolla, CA;San Diego Supercomputer Center, La Jolla, CA;University of California, San Diego, La Jolla, CA;San Diego Supercomputer Center, La Jolla, CA;San Diego Supercomputer Center, La Jolla, CA

  • Venue:
  • LSGRID'04 Proceedings of the First international conference on Life Science Grid
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The goal of the Encyclopedia of Life (EOL) Project is to predict structural information for all proteins, in all organisms. This calculation presents challenges both in terms of the scale of the computational resources required (approximately 1.8 million CPU hours), as well as in data and workflow management. While tools are available that solve some subsets of these problems, it was necessary for us to build software to integrate and manage the overall Grid application execution. In this paper, we present this workflow system, detail its components, and report on the performance of our initial prototype implementation for runs over a large-scale Grid platform during the SC'03 conference.