A probe effect in concurrent programs
Software—Practice & Experience
Perturbation analysis of high level instrumentation for SPMD programs
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Modeling and Evaluating Design Alternatives for an On-Line Instrumentation System: A Case Study
IEEE Transactions on Software Engineering
Semicoarsening Multigrid on Distributed Memory Machines
SIAM Journal on Scientific Computing
A tool framework for static and dynamic analysis of object-oriented software with templates
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Performance Measurement Intrusion and Perturbation Analysis
IEEE Transactions on Parallel and Distributed Systems
Application-Dependent Dynamic Monitoring of Distributed and Parallel Systems
IEEE Transactions on Parallel and Distributed Systems
An Adaptive Cost System for Parallel Program Instrumentation
Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing - Volume I
A model for instrumentation system management in concurrent computer systems
HICSS '95 Proceedings of the 28th Hawaii International Conference on System Sciences
Monitoring Strategies for Hypercube Systems
PDP '96 Proceedings of the 4th Euromicro Workshop on Parallel and Distributed Processing (PDP '96)
Systematic Assessment of the Overhead of Tracing Parallel Programs
PDP '96 Proceedings of the 4th Euromicro Workshop on Parallel and Distributed Processing (PDP '96)
Toward Scalable Performance Visualization with Jumpshot
International Journal of High Performance Computing Applications
The Tau Parallel Performance System
International Journal of High Performance Computing Applications
MPI performance analysis tools on Blue Gene/L
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Measuring and characterizing system behavior using kernel-level event logging
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Scaling molecular dynamics to 3000 processors with projections: a performance analysis case study
ICCS'03 Proceedings of the 2003 international conference on Computational science
Trace-based parallel performance overhead compensation
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Evaluating similarity-based trace reduction techniques for scalable performance analysis
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Scalable event trace visualization
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
Trace profiling: Scalable event tracing on high-end parallel systems
Parallel Computing
Alignment-Based metrics for trace comparison
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Hi-index | 0.00 |
Although event tracing of parallel applications offers highly detailed performance information, tracing on current leading edge systems may lead to unacceptable perturbation of the target program and unmanageably large trace files. High end systems of the near future promise even greater scalability challenges. Development of more scalable approaches requires a detailed understanding of the interactions between current approaches and high end runtime environments. In this paper we present the results of studies that examine several sources of overhead related to tracing: instrumentation, differing trace buffer sizes, periodic buffer flushes to disk, system changes, and increasing numbers of processors in the target application. As expected, the overhead of instrumentation correlates strongly with the number of events; however, our results indicate that the contribution of writing the trace buffer increases with increasing numbers of processors. We include evidence that the total overhead of tracing is sensitive to the underlying file system.