Open | SpeedShop: An open source infrastructure for parallel performance analysis

Authors:
Martin Schulz;Jim Galarowicz;Don Maghrak;William Hachfeld;David Montoya;Scott Cranford
Affiliations:
(Correspd. Lawrence Livermore National Laboratory, P.O. Box 808, L-560, Livermore, CA 94551, USA. Tel.: +1 925 423 6498/ E-mail: schulzm@llnl.gov) Lawrence Livermore National Laboratory, Livermore ...;Krell Insititute, Ames, IA, USA;Krell Insititute, Ames, IA, USA;Krell Insititute, Ames, IA, USA;Los Alamos National Laboratory, Los Alamos, NM, USA;Sandia National Laboratories, Livermore, CA, USA
Venue:
Scientific Programming - Large-Scale Programming Tools and Environments
Year:
2008

Citing 10
Cited 10

HPCVIEW: A Tool for Top-down Analysis of Node Performance

The Journal of Supercomputing
The Paradyn Parallel Performance Measurement Tool

Computer
hypre: A Library of High Performance Preconditioners

ICCS '02 Proceedings of the International Conference on Computational Science-Part III
The Dynamic Probe Class Library: An Infrastucture for Developing Instrumentation for Performance Tools

IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
SIGMA: a simulator infrastructure to guide memory analysis

Proceedings of the 2002 ACM/IEEE conference on Supercomputing
MRNet: A Software-Based Multicast/Reduction Network for Scalable Tools

Proceedings of the 2003 ACM/IEEE conference on Supercomputing
The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8,192 Processors of ASCI Q

Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Toward Scalable Performance Visualization with Jumpshot

International Journal of High Performance Computing Applications
An API for Runtime Code Patching

International Journal of High Performance Computing Applications
DynTG: a tool for interactive, dynamic instrumentation

ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part II

Lessons learned at 208K: towards debugging millions of cores

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Towards production monitoring of application progress

Proceedings of the 4th International Workshop on Software Engineering for Computational Science and Engineering
Automatic generation of executable communication specifications from parallel applications

Proceedings of the international conference on Supercomputing
Scalable fine-grained call path tracing

Proceedings of the international conference on Supercomputing
The VampirTrace plugin counter interface: introduction and examples

Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Reducing the overhead of direct application instrumentation using prior static analysis

Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Cache Conscious Task Regrouping on Multicore Processors

CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Novel views of performance data to analyze large-scale adaptive applications

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A new approach for performance analysis of openMP programs

Proceedings of the 27th international ACM conference on International conference on supercomputing
Alignment-Based metrics for trace comparison

Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Over the last decades a large number of performance tools has been developed to analyze and optimize high performance applications. Their acceptance by end users, however, has been slow: each tool alone is often limited in scope and comes with widely varying interfaces and workflow constraints, requiring different changes in the often complex build and execution infrastructure of the target application. We started the Open | SpeedShop project about 3 years ago to overcome these limitations and provide efficient, easy to apply, and integrated performance analysis for parallel systems. Open | SpeedShop has two different faces: it provides an interoperable tool set covering the most common analysis steps as well as a comprehensive plugin infrastructure for building new tools. In both cases, the tools can be deployed to large scale parallel applications using DPCL/Dyninst for distributed binary instrumentation. Further, all tools developed within or on top of Open | SpeedShop are accessible through multiple fully equivalent interfaces including an easy-to-use GUI as well as an interactive command line interface reducing the usage threshold for those tools.