StarPU-MPI: task programming over clusters of machines enhanced with accelerators

Authors:
Cédric Augonnet;Olivier Aumage;Nathalie Furmento;Raymond Namyst;Samuel Thibault
Affiliations:
NVIDIA Corporation, Santa Clara, California;Inria, Bordeaux, France;LaBRI, CNRS, University of Bordeaux, France;LaBRI, CNRS, University of Bordeaux, France;LaBRI, CNRS, University of Bordeaux, France
Venue:
EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
Year:
2012

Citing 2
Cited 0

StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures

Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Data-Aware Task Scheduling on Multi-accelerator Based Platforms

ICPADS '10 Proceedings of the 2010 IEEE 16th International Conference on Parallel and Distributed Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

GPUs clusters are becoming widespread HPC platforms. Exploiting them is however challenging, as this requires two separate paradigms (MPI and CUDA or OpenCL) and careful load balancing due to node heterogeneity. Current paradigms usually either limit themselves to offload part of the computation and leave CPUs idle, or require static CPU/GPU work partitioning. We thus have previously proposed StarPU, a runtime system able to dynamically scheduling tasks within a single heterogeneous node. We show how we extended the task paradigm of StarPU with MPI to easily map the task graph on MPI clusters and automatically benefit from optimized execution.