Statistical methods for automatic performance bottleneck detection in MPI based programs
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part I
Measuring empirical computational complexity
Proceedings of the the 6th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
Automatic Communication Performance Debugging in PGAS Languages
Languages and Compilers for Parallel Computing
A runtime approach for estimating resource usage
Proceedings of the Fourth Symposium on Information and Communication Technology
Hi-index | 0.00 |
The question how well a MPI program is scaling with an increasing number of processors becomes more and more interesting, especially when these number grows to 10.000 or even 100.000 with IBM's 'Blue Gene' this year. The approach presented with this paper is able to identify locations within the source code of an application where the communication effort does not scale well with the growing number of processors. We show how traces for the same program generated with different numbers of processors can be inspected and compared automatically. An analytical approach will then identify the points within the source that do not scale as expected. At the end of this article, the benefits from this method are demonstrated on an ASCI benchmark.