A tool framework for static and dynamic analysis of object-oriented software with templates
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Design and Prototype of a Performance Tool Interface for OpenMP
The Journal of Supercomputing
Design and Implementation of a Parallel Performance Data Management Framework
ICPP '05 Proceedings of the 2005 International Conference on Parallel Processing
An API for Runtime Code Patching
International Journal of High Performance Computing Applications
PerfExplorer: A Performance Data Mining Framework For Large-Scale Parallel Computing
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
The Tau Parallel Performance System
International Journal of High Performance Computing Applications
Hi-index | 0.00 |
Nested OpenMP parallelism allows an application to spawn teams of nested threads. This hierarchical nature of thread creation and usage poses problems lbr performance measurement tools that must determine thread context to properly maintain per-thread performance data. In this paper we describe the problem and a novel solution for identifying threads uniquely. Our approach has been implemented in the TAU performance system and has been successfully used in profiling and tracing OpenMP applications with nested parallelism. We also describe how extensions to the OpenMP standard can help cool developers uniquely identify threads.