Checkpointing and migration of communication channels in heterogeneous grid environments

  • Authors:
  • John Mehnert-Spahn;Michael Schoettner

  • Affiliations:
  • Heinrich-Heine University, Duesseldorf, NRW, Germany;Heinrich-Heine University, Duesseldorf, NRW, Germany

  • Venue:
  • ICA3PP'10 Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

A grid checkpointing service providing migration and transparent fault tolerance is important for distributed and parallel applications executed in heterogeneous grids In this paper we address the challenges of checkpointing and migrating communication channels of grid applications executed on nodes equipped with different checkpointer packages We present a solution that is transparent for the applications and the underlying checkpointers It also allows using single node checkpointers for distributed applications The measurement numbers show only a small overhead especially with respect to large grid-applications where checkpointing may consume many minutes.