A Scalable Non-blocking Multicast Scheme for Distributed DAG Scheduling

  • Authors:
  • Fengguang Song;Jack Dongarra;Shirley Moore

  • Affiliations:
  • EECS Department, University of Tennessee, Knoxville, USA;EECS Department, University of Tennessee, Knoxville, USA and Oak Ridge National Laboratory, Oak Ridge, USA;EECS Department, University of Tennessee, Knoxville, USA

  • Venue:
  • ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an application-level non-blocking multicast scheme for dynamic DAG scheduling on large-scale distributed-memory systems. The multicast scheme takes into account both network topology and space requirement of routing tables to achieve scalability. Specifically, we prove that the scheme is deadlock-free and takes at most logN steps to complete. The routing table chooses appropriate neighbors to store based on topology IDs and has a small space of O (logN ). Although built upon MPI point-to-point operations, the experimental results show that our scheme is significantly better than the simple flat-tree method and is comparable to vendor's collective MPI operations.