Design, Implementation, and Performance of Checkpointing in NetSolve

  • Authors:
  • Adnan Agbaria;James S. Plank

  • Affiliations:
  • -;-

  • Venue:
  • DSN '00 Proceedings of the 2000 International Conference on Dependable Systems and Networks (formerly FTCS-30 and DCCA-8)
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

While a variety of checkpointing techniques and systems has been documented for long-running programs, they are typically not available for programmers that are non-systems experts. This paper details a project that integrates three technologies, NetSolve, Starfish, and IBP, for the seamless integration of fault-tolerance into long-running applications. We discuss the design and implementation of this project, and present performance results executing on both local and wide-area networks.