Data dependence and its application to parallel processing
International Journal of Parallel Programming
Structure of Computers and Computations
Structure of Computers and Computations
An approach to synchronization for parallel computing
ICS '88 Proceedings of the 2nd international conference on Supercomputing
Parallel processor balance through loop spreading
Proceedings of the 1989 ACM/IEEE conference on Supercomputing
Compiler techniques for data synchronization in nested parallel loops
ICS '90 Proceedings of the 4th international conference on Supercomputing
Automatic parallelization of discrete event simulation programs
WSC '93 Proceedings of the 25th conference on Winter simulation
Compile Time Barrier Synchronization Minimization
IEEE Transactions on Parallel and Distributed Systems
Exploiting Parallelism Across Program Execution: A Unification Technique and Its Analysis
IEEE Transactions on Parallel and Distributed Systems
An Efficient Technique of Instruction Scheduling on a Superscalar-Based Mulprocessor
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
On the parallelization of loop nests containing while loops
PAS '95 Proceedings of the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis
A practical approach to DOACROSS parallelization
Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Hi-index | 0.00 |
Execution of concurrent loops on multiprocessor computers often requires synchronizing the processors. Synchronization schemes are surveyed that are suitable for automatic problem decomposition. The model of a shared-memory multiprocessor is used, as is the concurrent-loop paradigm, which is to compile a loop so each processor is assigned a different loop iteration. The discussion covers data dependence, removing synchronization points, random synchronization, pipelining, barrier synchronization, and critical sections.