Applicability of generic naming services and fault-tolerant metacomputing with FT-MPI

  • Authors:
  • David Dewolfs;Dawid Kurzyniec;Vaidy Sunderam;Jan Broeckhove;Tom Dhaene;Graham Fagg

  • Affiliations:
  • Depts. of Math and Computer Science, University of Antwerp, Belgium and Emory University, Atlanta, GA;Depts. of Math and Computer Science, University of Antwerp, Belgium and Emory University, Atlanta, GA;Depts. of Math and Computer Science, University of Antwerp, Belgium and Emory University, Atlanta, GA;Depts. of Math and Computer Science, University of Antwerp, Belgium and Emory University, Atlanta, GA;Depts. of Math and Computer Science, University of Antwerp, Belgium and Emory University, Atlanta, GA;Depts. of Math and Computer Science, University of Antwerp, Belgium and Emory University, Atlanta, GA

  • Venue:
  • PVM/MPI'05 Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

There is a growing interest in deploying MPI over multiple, heterogenous and geographically distributed resources for performing very large scale computations. However, increasing the amount of geographical distribution and resources creates problems with interoperability and fault-tolerance. FT-MPI presents an interesting solution for adding fault-tolerance to MPI, but suffers from interoperability limitations and potential single points of failure when crossing multiple administrative domains. We propose to overcome these limitations by adding “pluggability” for one potential single point of failure – the name service used by FT-MPI – and combining FT-MPI with the H2O metacomputing framework.