Automatic resource-centric process migration for MPI

  • Authors:
  • Amnon Barak;Alexander Margolin;Amnon Shiloh

  • Affiliations:
  • Department of Computer Science, The Hebrew University of Jerusalem, Israel;Department of Computer Science, The Hebrew University of Jerusalem, Israel;Department of Computer Science, The Hebrew University of Jerusalem, Israel

  • Venue:
  • EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Process migration refers to the ability to move a running process from one node and make it continue on another. The MPI standard prescribes support for process migration, but so far it was implemented mostly via checkpoint-restart. This paper presents an automatic and transparent process migration framework that can be used for MPI processes. This framework is advantageous when migration of individual processes for purposes such as load-balancing is more adequate than checkpointing the whole job. The paper describes this framework for process migration in clusters and multi-clusters, how it was tuned for Open MPI and the performance of migrated MPI processes.