Compiler-enhanced incremental checkpointing for OpenMP applications

  • Authors:
  • Greg Bronevetsky;Daniel J. Marques;Keshav K. Pingali;Radu Rugina;Sally A. McKee

  • Affiliations:
  • Lawrence Livermore National Laboratory, Livermore, CA, USA;University of Texas at Austin, Austin, TX, USA;University of Texas at Austin, Austin, TX, USA;Cornell University, Ithaca, NY, USA;Cornell University, Ithaca, NY, USA

  • Venue:
  • Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

As modern supercomputing systems reach peta-flop performance they grow in both size and complexity, becoming increasingly vulnerable to failures. Checkpointing is a popular technique for tolerating such failures. Although a variety of automated system-level checkpointing solutions are currently available to HPC users, manual application-level checkpointing remains more popular due to its superior performance. This paper improves performance of automated checkpointing by presenting a compiler analysis for incremental checkpointing. This analysis, which works with both sequential and OpenMP applications, significantly reduces checkpoint sizes and enables asynchronous checkpointing.