HPCVIEW: A Tool for Top-down Analysis of Node Performance
The Journal of Supercomputing
hypre: A Library of High Performance Preconditioners
ICCS '02 Proceedings of the International Conference on Computational Science-Part III
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
SIGMA: a simulator infrastructure to guide memory analysis
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
MRNet: A Software-Based Multicast/Reduction Network for Scalable Tools
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Toward Scalable Performance Visualization with Jumpshot
International Journal of High Performance Computing Applications
An API for Runtime Code Patching
International Journal of High Performance Computing Applications
DynTG: a tool for interactive, dynamic instrumentation
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part II
Lessons learned at 208K: towards debugging millions of cores
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Towards production monitoring of application progress
Proceedings of the 4th International Workshop on Software Engineering for Computational Science and Engineering
Automatic generation of executable communication specifications from parallel applications
Proceedings of the international conference on Supercomputing
Scalable fine-grained call path tracing
Proceedings of the international conference on Supercomputing
The VampirTrace plugin counter interface: introduction and examples
Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Reducing the overhead of direct application instrumentation using prior static analysis
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Cache Conscious Task Regrouping on Multicore Processors
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Novel views of performance data to analyze large-scale adaptive applications
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A new approach for performance analysis of openMP programs
Proceedings of the 27th international ACM conference on International conference on supercomputing
Alignment-Based metrics for trace comparison
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Hi-index | 0.00 |
Over the last decades a large number of performance tools has been developed to analyze and optimize high performance applications. Their acceptance by end users, however, has been slow: each tool alone is often limited in scope and comes with widely varying interfaces and workflow constraints, requiring different changes in the often complex build and execution infrastructure of the target application. We started the Open | SpeedShop project about 3 years ago to overcome these limitations and provide efficient, easy to apply, and integrated performance analysis for parallel systems. Open | SpeedShop has two different faces: it provides an interoperable tool set covering the most common analysis steps as well as a comprehensive plugin infrastructure for building new tools. In both cases, the tools can be deployed to large scale parallel applications using DPCL/Dyninst for distributed binary instrumentation. Further, all tools developed within or on top of Open | SpeedShop are accessible through multiple fully equivalent interfaces including an easy-to-use GUI as well as an interactive command line interface reducing the usage threshold for those tools.