IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
An API for Runtime Code Patching
International Journal of High Performance Computing Applications
Memory Allocation Tracing with VampirTrace
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part II
Performance cockpit: an extensible GUI platform for performance tools
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Hi-index | 0.00 |
We developed an automated environment to support the analysis of memory access behaviors of applications on high performance clusters. Code optimization targeting efficient use of processor caches is crucial for achieving good performance on such systems. Our environment is able to selectively instrument OpenMP Fortran95 programs upon requests of programmer. The monitor can be configured to collect hardware counter information on specified code regions. Limitations due to the number of available physical hardware counters are automatically taken into account. The whole environment is controlled through a friendly user interface based on Eclipse and is highly portable.