ACM Transactions on Computer Systems (TOCS)
Processor Allocation in the Mesh Multiprocessors Using the Leapfrog Method
IEEE Transactions on Parallel and Distributed Systems
Hi-index | 0.01 |
In multicomputer architectures where communication latency is distance independent, thread placement is expected to have a limited impact on an application's performance. In this paper, the impact of thread placement on application performance is demonstrated on a wormhole routed multicomputer, the Intel Paragon. A communication intensive synthetic workload is used to "stress test" the effects of contention on communication latency induced by thread placement. It is shown by means of experimentation and modeling that appropriate thread placement patterns minimizing contention in the system's interconnection network improve performance. The analytic model and the experimental observations are in good agreement.