From trace generation to visualization: a performance framework for distributed parallel systems

  • Authors:
  • C. Eric Wu;Anthony Bolmarcich;Marc Snir;David Wootton;Farid Parpia;Anthony Chan;Ewing Lusk;William Gropp

  • Affiliations:
  • IBM T.J. Watson Research Center, P.O. Box 218, Yorktown Heights, NY;IBM T.J. Watson Research Center, P.O. Box 218, Yorktown Heights, NY;IBM T.J. Watson Research Center, P.O. Box 218, Yorktown Heights, NY;IBM RS6000 Division, 2455 South Road, Poughkeepsie, NY;IBM RS6000 Division, 2455 South Road, Poughkeepsie, NY;Mathematics and Computer Science Division, Angonne National Laboratory, Argonne, IL;Mathematics and Computer Science Division, Angonne National Laboratory, Argonne, IL;Mathematics and Computer Science Division, Angonne National Laboratory, Argonne, IL

  • Venue:
  • Proceedings of the 2000 ACM/IEEE conference on Supercomputing
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we describe a trace analysis framework, from trace generation to visualization. It includes a unified tracing facility on IBM SP systems, a self-defining interval file format, an API for framework extensions, utilities for merging and statistics generation, and a visualization tool with preview and multiple time-space diagrams. The trace environment is extremely scalable, and combines MPI events with system activities in the same set of trace files, one for each SMP node. Since the amount of trace data may be very large, utilities are developed to convert and merge individual trace files into a self-defining interval trace file with multiple frame directories. The interval format allows the development of multiple time-space diagrams, such as thread-activity view, processor-activity view, etc., from the same interval file. A visualization tool, Jumpshot, is modified to visualize these views. A statistics utility is developed using the API along with its graphics viewer.