A New Approach to Parallel Debugger Architecture

  • Authors:
  • Susanne M. Balle;Bevin R. Brett;Chih-Ping Chen;David LaFrance-Linden

  • Affiliations:
  • -;-;-;-

  • Venue:
  • PARA '02 Proceedings of the 6th International Conference on Applied Parallel Computing Advanced Scientific Computing
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

Modern computer systems are placing systems with hundreds even tens of thousands of CPUs in the hands of many researchers and commercial users. The debugging methods used on these systems are a combination of traditional and ad-hoc methods. Programmers are usually familiar with a serial debugger, and would like to use it to debug their distributed program. We present a set of modifications to a traditional debugger that makes it capable of debugging massively parallel applications. Our parallel debugger is composed of individual fully functional debuggers connected with an n-nary aggregating network to condense debugger outputs. This allows us to present the user with a global view of the application. The user can thereby easier see if a given parameter has a different value than expected and then focus on the problem. Experiments show that the debugger, both startup and users' command response time, is scalable to thousands of processors.