Bringing Reverse Debugging to HPC

  • Authors:
  • Chris Gottbrath

  • Affiliations:
  • TotalView Technologies, Natick 01760

  • Venue:
  • Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Reverse debugging is a technique for troubleshooting and analyzing software that allows developers to work directly from a software failure to the source code error that led to that failure. ReplayEngine makes this technique available for High Performance Computing (HPC) environments. This paper presents an exploration of the challenges we face and solutions that we are exploring as we develop ReplayEngine into a mature HPC reverse debugging solution.