LogP: towards a realistic model of parallel computation
PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Software—Practice & Experience
Automated performance prediction of message-passing parallel programs
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Analyzing scheduling policies using Dimemas
Parallel Computing - Special double issue on environment and tools for parallel scientific computing
LogGP: incorporating long messages into the LogP model for parallel computation
Journal of Parallel and Distributed Computing
ICS '98 Proceedings of the 12th international conference on Supercomputing
Critical Path Profiling of Message Passing and Shared-Memory Programs
IEEE Transactions on Parallel and Distributed Systems
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
LogGPS: a parallel computational model for synchronization analysis
PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
A Tool to Help Tune where Computation Is Performed
IEEE Transactions on Software Engineering
Performance coupling: case studies for improving the performance of scientific applications
Journal of Parallel and Distributed Computing
Visualizing the Performance of Parallel Programs
IEEE Software
HiPC '02 Proceedings of the 9th International Conference on High Performance Computing
MICCAI '01 Proceedings of the 4th International Conference on Medical Image Computing and Computer-Assisted Intervention
Prophesy: an infrastructure for performance analysis and modeling of parallel and grid applications
ACM SIGMETRICS Performance Evaluation Review
Gprof: A call graph execution profiler
SIGPLAN '82 Proceedings of the 1982 SIGPLAN symposium on Compiler construction
SvPablo: A Multi-Language Architecture-Independent Performance Analysis System
ICPP '99 Proceedings of the 1999 International Conference on Parallel Processing
Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
Toward Scalable Performance Visualization with Jumpshot
International Journal of High Performance Computing Applications
Hi-index | 0.00 |
This paper presents PerWiz, a performance prediction tool for improving the performance of message passing programs. PerWiz focuses on locating where a significant improvement can be achieved. To locate this, PerWiz performs a post-mortem analysis based on a realistic parallel computational model, LogGPS, so that predicts what performance will be achieved if the programs are modified according to typical tuning techniques, such as load balancing for a better workload distribution and message scheduling for a shorter waiting time. We also show two case studies where PerWiz played an important role in improving the performance of regular applications. Our results indicate that PerWiz is useful for application developers to assess the potential reduction in execution time that will be derived from program modification.