Distributing Hot-Spot Addressing in Large-Scale Multiprocessors
IEEE Transactions on Computers
A scalable implementation of barrier synchronization using an adaptive combining tree
International Journal of Parallel Programming
Deadlock-free multicast wormhole routing in multicomputer networks
ISCA '91 Proceedings of the 18th annual international symposium on Computer architecture
Optimal Hot Spot Allocation on Meshes for Large-Scale Data-Parallel Algorithms
IEEE Transactions on Parallel and Distributed Systems
On minimizing data sharing overhead for large-scale data-parallel algorithms: replication and allocation of shared data
Optimal Hot Spot Allocation on Meshes for Large-Scale Data-Parallel Algorithms
IEEE Transactions on Parallel and Distributed Systems
Hi-index | 0.03 |
Hot spots are notorious for degrading the performance of a parallel algorithm. In this paper, we attempt to minimize the hot-spot access time for a class of problems, namely, Large-Scale Data-Parallel (LSDP) algorithms, on a 2D mesh. An LSDP algorithm has rich data parallelism but without exclusive task and data partitioning. Our approach is to allocate the hot spots at the optimal locations such that the hot-spot access time is minimized. Also, we have designed scheduling algorithms which control hot spot access sequences to achieve the minimal access time. Both uniform and nonuniform hot spots have been considered in this study. We have analytically derived the optimal allocations for wrapped-around and non-wrapped-around square meshes. The theoretical results have been verified by parallelizing the EM algorithm for 3D PET image reconstruction on the Intel iPSC/860.