Portable profiling and tracing for parallel, scientific applications using C++
SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
Co-array Fortran for parallel programming
ACM SIGPLAN Fortran Forum
MPI-The Complete Reference, Volume 1: The MPI Core
MPI-The Complete Reference, Volume 1: The MPI Core
Automatic performance analysis of hybrid MPI/OpenMP applications
Journal of Systems Architecture: the EUROMICRO Journal - Special issue: Evolutions in parallel distributed and network-based processing
Performance Tool Support for MPI-2 on Linux
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
A Scalable Implementation of a Finite-Volume Dynamical Core in the Community Atmosphere Model
International Journal of High Performance Computing Applications
A performance measurement infrastructure for co-array fortran
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Specification of inefficiency patterns for MPI-2 one-sided communication
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
A performance measurement infrastructure for co-array fortran
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Scalable detection of MPI-2 remote memory access inefficiency patterns
International Journal of High Performance Computing Applications
Hi-index | 0.00 |
To analyze the correctness and the performance of a program, information about the dynamic behavior of all participating processes is needed. The dynamic behavior can be modeled as a stream of events required for a later analysis including appropriate attributes. Based on this idea, kojak, a trace-based toolkit for performance analysis, records and analyzes the activities of mpi-1 point-to-point and collective communication. To support remote-memory access (rma) hardware in a portable way, mpi-2 introduced a standardized interface for remote memory access. However, potential performance gains come at the expense of more complex semantics. From a programmer's point of view, an mpi-2 data transfer is only completed after a sequence of communication and associated synchronization calls. This paper describes the integration of performance measurement and analysis methods for rma communication into the kojak toolkit. Special emphasis is put on the underlying event model used to represent the dynamic behavior of mpi-2 rma operations. We show that our model reflects the relationships between communication and synchronization more accurately than existing models. In addition, the model is general enough to also cover alternate but simpler rma interfaces, such as shmem and Co-Array Fortran.