Portable performance of data parallel languages

Authors:
Ton Ngo;Lawrence Snyder;Bradford Chamberlain
Affiliations:
IBM T. J. Watson Research Center, Yorktown Heights, NY;University of Washington, Seattle, WA;University of Washington, Seattle, WA
Venue:
SC '97 Proceedings of the 1997 ACM/IEEE conference on Supercomputing
Year:
1997

Citing 7
Cited 7

Compiling Fortran 90D/HPF for distributed memory MIMD computers

Journal of Parallel and Distributed Computing - Special issue on data parallel algorithms and programming
Evaluating compiler optimizations for Fortran D

Journal of Parallel and Distributed Computing - Special issue on data parallel algorithms and programming
High-performance parallel implementations of the NAS kernel benchmarks on the IBM SP2

IBM Systems Journal
Compiling high performance Fortran for distributed-memory systems

Digital Technical Journal
The role of performance models in parallel programming and languages

The role of performance models in parallel programming and languages
Factor-Join: A Unique Approach to Compiling Array Languages for Parallel Machines

LCPC '96 Proceedings of the 9th International Workshop on Languages and Compilers for Parallel Computing
ZPL: An Array Sublanguage

Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing

ZPL: A Machine Independent Programming Language for Parallel Computers

IEEE Transactions on Software Engineering - Special issue on architecture-independent languages and software tools for parallel processing
A comparative study of the NAS MG benchmark across parallel languages and architectures

Proceedings of the 2000 ACM/IEEE conference on Supercomputing
The Case for High-Level Parallel Programming in ZPL

IEEE Computational Science & Engineering
Parallel Computation: MM +/- X

Informatics - 10 Years Back. 10 Years Ahead.
The scalable modeling system: directive-based code parallelization for distributed and shared memory computers

Parallel Computing
Parallelization of the NAS Conjugate Gradient Benchmark Using the Global Arrays Shared Memory Programming Model

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 4 - Volume 05
Parallel Programmability and the Chapel Language

International Journal of High Performance Computing Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

A portable program executes on different platforms and yields consistent performance. With the focus on portability, this paper presents an in-depth study of the performance of three NAS benchmarks (EP, MG, FT) compiled with three commercial HPF compilers (APR, PGI, IBM) on the IBM SP2. Each benchmark is evaluated in two versions: using DO loops and using F90 constructs and/or HPF's Forall statement. Base-line comparison is provided by versions of the benchmarks written in Fortran/MPI and ZPL, a data parallel language developed at the University of Washington.While some F90/Forall programs achieve scalable performance with some compilers, the results indicate a considerable portability problem in HPF programs. Two sources for the problem are identified. First, Fortran's semantics require extensive analysis and optimization to arrive at a parallel program; therefore relying on the compiler's capability alone leads to unpredictable performance. Second, the wide differences in the parallelization strategies used by each compiler may require an HPF program to be customized for the particular compiler. While improving compiler optimizations may help to reduce some performance variations, the results suggest that the foremost criteria for portability is a concise performance model that the compiler must adhere to and that the users can rely on.