Improving Compiler and Run-Time Support for Irregular Reductions Using Local Writes

  • Authors:
  • Hwansoo Han;Chau-Wen Tseng

  • Affiliations:
  • -;-

  • Venue:
  • LCPC '98 Proceedings of the 11th International Workshop on Languages and Compilers for Parallel Computing
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

Current compilers for distributed-memory multiprocessors parallelize irregular reductions either by generating calls to sophisticated run-time systems (CHAOS) or by relying on replicated buffers and the shared-memory interface supported by software DSMs (TreadMarks). We introduce LocalWrite, a new technique for parallelizing irregular reductions based on the owner-computes rule. It eliminates the need for buffers or synchronized writes, but may replicate computation. We investigate the impact of connectivity (node/edge ratio), locality (accesses to local data) and adaptivity (edge modifications) on their relative performance. LocalWrite improves performance by 50-150% compared to using replicated buffers, and can match or exceed gather/scatter for applications with low locality or high adaptivity.