The high performance Fortran handbook
The high performance Fortran handbook
Exploitation of symbolic information in interprocedural dependence analysis
Parallel Computing
A graph based approach to barrier synchronisation minimisation
ICS '97 Proceedings of the 11th international conference on Supercomputing
Maximizing parallelism and minimizing synchronization with affine transforms
Proceedings of the 24th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Thread fork/join techniques for multi-level parallelism exploitation in NUMA multiprocessors
ICS '99 Proceedings of the 13th international conference on Supercomputing
Parallelization of NAS Benchmarks for Shared Memory Multiprocessore
HPCN Europe 1998 Proceedings of the International Conference and Exhibition on High-Performance Computing and Networking
ICCS '02 Proceedings of the International Conference on Computational Science-Part II
Automation of Data Traffic Control on DSM Architectures
ICCS '01 Proceedings of the International Conference on Computational Science-Part II
Generating OpenMP code using an interactive parallelization environment
Parallel Computing - OpenMp
Nested parallelization of the flow solver TFS using the ParaWise parallelization environment
IWOMP'05/IWOMP'06 Proceedings of the 2005 and 2006 international conference on OpenMP shared memory parallel programming
An expert assistant for computer aided parallelization
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
WOMPAT'04 Proceedings of the 5th international conference on OpenMP Applications and Tools: shared Memory Parallel Programming with OpenMP
Automatic scoping of variables in parallel regions of an OpenMP program
WOMPAT'04 Proceedings of the 5th international conference on OpenMP Applications and Tools: shared Memory Parallel Programming with OpenMP
Hi-index | 0.00 |
The shared-memory programming model is a very effective way to achieve parallelism on shared memory parallel computers. As great progress was made in hardware and software technologies, performance of parallel programs with compiler directives has demonstrated large improvement. The introduction of OpenMP directives, the industrial standard for shared-memory programming, has minimized the issue of portability. In this study, we have extended CAPTools, a computer-aided parallelization toolkit, to automatically generate OpenMPbased parallel programs with nominal user assistance. We outline techniques used in the implementation of the tool and discuss the application of this tool on the NAS Parallel Benchmarks and several computational fluid dynamics codes. This work demonstrates the great potential of using the tool to quickly port parallel programs and also achieve good performance that exceeds some of the commercial tools.