Large-grain pipelining on hypercube multiprocessors

Authors:
C-T. King;L. M. Ni
Affiliations:
Department of Computer Science, Michigan State University, East Lansing, Michigan;Division of Mathematics and Computer Science, Argonne National Laboratory, Argonne, IL
Venue:
C3P Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2
Year:
1989

Citing 2
Cited 0

Data parallel algorithms

Communications of the ACM - Special issue on parallelism
Computer Architecture and Parallel Processing

Computer Architecture and Parallel Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

A new paradigm, called large-grain pipelining, for developing efficient parallel algorithms on distributed-memory multiprocessors, e.g., hypercube machines, is introduced. Large-grain pipelining attempts to maximize the degree of overlapping and minimize the effect of communication overhead in a multiprocessor system through macro-pipelining between the nodes. Algorithms developed through large-grain pipelining to perform matrix multiplication are presented. To model the pipelined computations, an analytic model is introduced, which takes into account both underlying architecture and algorithm behavior. Through the analytic model, important design parameters, such as data partition sizes, can be determined. Experiments were conducted on a 64-node NCUBE multiprocessor. The measured results match closely with the analyzed results, which establishes the analytic model as an integral part of algorithm design. Comparison with an algorithm which does not use large-grain pipelining also shows that large-grain pipelining is an efficient scheme for achieving a greater parallelism.