Event-based performance perturbation: a case study
PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
Models for performance perturbation analysis
PADD '91 Proceedings of the 1991 ACM/ONR workshop on Parallel and distributed debugging
Perturbation analysis of high level instrumentation for SPMD programs
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Performance technology for complex parallel and distributed systems
Distributed and parallel systems
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
Dynamic statistical profiling of communication activity in distributed applications
SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Performance Measurement Intrusion and Perturbation Analysis
IEEE Transactions on Parallel and Distributed Systems
Performance observability
Advances in the TAU performance system
Performance analysis and grid computing
A Portable Programming Interface for Performance Evaluation on Modern Processors
International Journal of High Performance Computing Applications
Models for on-the-fly compensation of measurement overhead in parallel performance profiling
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
On-Line Performance Modeling for MPI Applications
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
On the Performance of Transparent MPI Piggyback Messages
Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Hi-index | 0.00 |
Performance profiling of MPI programs generates overhead during execution that introduces error in profile measurements. It is possible to track and remove overhead online, but it is necessary to communicate execution delay between processes to correctly adjust their interdependent timing. We demonstrate the first implementation of a onlne measurement overhead compensation system for profiling MPI programs. This is implemented in the Tau performance systems. It requires novel techniques for delay communication in the use of MPI. The ability to reduce measurement error is demonstrated for problematic test cases and real applications.