Adaptive workflow processing and execution in Pegasus

  • Authors:
  • Kevin Lee;Norman W. Paton;Rizos Sakellariou;Ewa Deelman;Alvaro A. A. Fernandes;Gaurang Mehta

  • Affiliations:
  • School of Computer Science, University of Manchester, Oxford Road, Manchester M13 9PL, U.K.;School of Computer Science, University of Manchester, Oxford Road, Manchester M13 9PL, U.K.;School of Computer Science, University of Manchester, Oxford Road, Manchester M13 9PL, U.K.;University of Southern California, Information Sciences Institute, 4676 Admiralty Way, Marina Del Ray, CA 90292, U.S.A.;School of Computer Science, University of Manchester, Oxford Road, Manchester M13 9PL, U.K.;University of Southern California, Information Sciences Institute, 4676 Admiralty Way, Marina Del Ray, CA 90292, U.S.A.

  • Venue:
  • Concurrency and Computation: Practice & Experience - Special Issue: 3rd International Workshop on Workflow Management and Applications in Grid Environments (WaGe2008)
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Workflows are widely used in applications that require coordinated use of computational resources. Workflow definition languages typically abstract over some aspects of the way in which a workflow is to be executed, such as the level of parallelism to be used or the physical resources to be deployed. As a result, a workflow management system has the responsibility of establishing how best to execute a workflow given the available resources. The Pegasus workflow management system compiles abstract workflows into concrete execution plans, and has been widely used in large-scale e-Science applications. This paper describes an extension to Pegasus whereby resource allocation decisions are revised during workflow evaluation, in the light of feedback on the performance of jobs at runtime. The contributions of this paper include: (i) a description of how adaptive processing has been retrofitted to an existing workflow management system; (ii) a scheduling algorithm that allocates resources based on runtime performance; and (iii) an experimental evaluation of the resulting infrastructure using grid middleware over clusters. Copyright © 2009 John Wiley & Sons, Ltd.