On cluster resource allocation for multiple parallel task graphs

  • Authors:
  • Henri Casanova;Frédéric Desprez;Frédéric Suter

  • Affiliations:
  • Department of Information and Computer Sciences, University of Hawai'i at Manoa, USA;LIP UMR 5668, ENS Lyon, INRIA, CNRS, UCBL, University of Lyon, France;IN2P3 Computing Center, CNRS/IN2P3, Lyon-Villeurbanne, France

  • Venue:
  • Journal of Parallel and Distributed Computing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many scientific applications can be structured as parallel task graphs (PTGs), that is, graphs of data-parallel tasks. Adding data parallelism to a task-parallel application provides opportunities for higher performance and scalability, but poses additional scheduling challenges. In this paper, we study the off-line scheduling of multiple PTGs on a single, homogeneous cluster. The objective is to optimize performance without compromising fairness among the PTGs. We consider the range of previously proposed scheduling algorithms applicable to this problem, from both the applied and the theoretical literature, and we propose minor improvements when possible. Our main contribution is an extensive evaluation of these algorithms in simulation, using both synthetic and real-world application configurations, using two different metrics for performance and one metric for fairness. We identify a handful of algorithms that provide good trade-offs when considering all these metrics. The best algorithm overall is one that structures the schedule as a sequence of phases of increasing duration based on a makespan guarantee produced by an approximation algorithm.