Co-array Fortran for parallel programming
ACM SIGPLAN Fortran Forum
Statistical scalability analysis of communication operations in distributed applications
PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
HPCVIEW: A Tool for Top-down Analysis of Node Performance
The Journal of Supercomputing
Visualizing the Performance of Parallel Programs
IEEE Software
DiP: A Parallel Program Development Environment
Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing-Volume II
SvPablo: A Multi-language Performance Analysis System
TOOLS '98 Proceedings of the 10th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
The role of instrumentation and mapping in performance measurement
The role of instrumentation and mapping in performance measurement
The Tau Parallel Performance System
International Journal of High Performance Computing Applications
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
GASP! a standardized performance analysis tool interface for global address space programming models
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Platform-aware bottleneck detection for reconfigurable computing applications
ACM Transactions on Reconfigurable Technology and Systems (TRETS)
Improving UPC productivity via integrated development tools
Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model
Exploring cross-layer power management for PGAS applications on the SCC platform
Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Performance analysis of SCOOP programs
Journal of Systems and Software
Understanding the formation of wait states in applications with one-sided communication
Proceedings of the 20th European MPI Users' Group Meeting
Hi-index | 0.01 |
Given the complexity of high-performance parallel programs, developers often must rely on performance analysis tools to help them improve the performance of their applications. While many tools support analysis of message-passing programs, tool support is limited for applications written in programming models that present a partitioned global address space (PGAS) to the programmer such as UPC and SHMEM. Existing tools that support message-passing models are difficult to extend to support PGAS models due to differences between the two paradigms and the techniques used in their implementations. In this paper, we present our work on Parallel Performance Wizard (PPW), a performance analysis system for PGAS and MPI application analysis. We discuss new concepts, namely the generic-operation-type abstraction and GASP-enabled data collection, developed to facilitate support for multiple programming models and then give an overview of PPWâ聙聶s automatic analysis and visualization capabilities. Finally, to show the usefulness of our system, we present results on PPWâ聙聶s overhead, storage requirements and scalability before demonstrating its effectiveness via application case studies.