Cilk: an efficient multithreaded runtime system
PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Parallel programming with MPI
Programming with POSIX threads
Programming with POSIX threads
Co-array Fortran for parallel programming
ACM SIGPLAN Fortran Forum
Parallel programming in OpenMP
Parallel programming in OpenMP
The High Performance FORTRAN Handbook
The High Performance FORTRAN Handbook
Parallel Programming and Parallel Abstractions in Fortress
Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques
X10: an object-oriented approach to non-uniform cluster computing
OOPSLA '05 Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Introduction to the cell multiprocessor
IBM Journal of Research and Development - POWER5 and packaging
Parallel Programmability and the Chapel Language
International Journal of High Performance Computing Applications
Intel threading building blocks
Intel threading building blocks
Numerical Simulation in Molecular Dynamics: Numerics, Algorithms, Parallelization, Applications
Numerical Simulation in Molecular Dynamics: Numerics, Algorithms, Parallelization, Applications
Data dependence analysis for the parallelization of numerical tree codes
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Data parallel iterators for hierarchical grid and tree algorithms
Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
Hi-index | 0.00 |
There are several parallel programming models available for numerical computations at different levels of expressibility and ease of use. For the development of new domain specific programming models, a splitting into a distributed data container and parallel data iterators is proposed. Data distribution is implemented in application specific libraries. Data iterators are directly analysed and compiled automatically into parallel code. Target architectures of the source-to-source translation include shared (pthreads, Cell SPE), distributed memory (MPI) and hybrid programming styles. A model applications for grid based hierarchical numerical methods and an auto-parallelizing compiler are introduced.