Evaluation of mechanisms for fine-grained parallel programs in the J-machine and the CM-5

  • Authors:
  • Ellen Spertus;Seth Copen Goldstein;Klaus Erik Schauser;Thorsten von Eicken;David E. Culler;William J. Dally

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper uses an abstract machine approach to compare the mechanisms of two parallel machines: the J-Machine and the CM-5. High-level parallel programs are translated by a single optimizing compiler to a fine-grained abstract parallel machine, TAM. A final compilation step is unique to each machine and optimizes for specifics of the architecture. By determining the cost of the primitives and weighting them by their dynamic frequency in parallel programs, we quantify the effectiveness of the following mechanisms individually and in combination. Efficient processor/network coupling proves valuable. Message dispatch is found to be less valuable without atomic operations that allow the scheduling levels to cooperate. Multiple hardware contexts are of small value when the contexts cooperate and the compiler can partition the register set. Tagged memory provides little gain. Finally, the performance of the overall system is strongly influenced by the performance of the memory system and the frequency of control operations.