High Performance Compilers for Parallel Computing
High Performance Compilers for Parallel Computing
Automatic intra-register vectorization for the Intel architecture
International Journal of Parallel Programming
SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance
WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
A C++ infrastructure for automatic introduction and translation of OpenMP directives
WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
A practical OpenMP compiler for system on chips
WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
Hi-index | 0.00 |
In this paper, we describe several compiler optimization techniques and their effect on the performance of OpenMP programs We elaborate on the major design considerations in a high performance OpenMP compiler and present experimental data based on the implementation of the optimizations in the Intel® C++ and Fortran compilers for Intel platforms Interactions of the OpenMP translation phase with other sequential optimizations in the compiler are discussed The techniques in this paper are responsible for achieving significant performance improvements on the industry standard SPEC* OMPM2001 and SPEC* OMPL2001 benchmarks, and these results are presented for Intel® Pentium® and Itanium® processor based systems.