Elemental: A New Framework for Distributed Memory Dense Matrix Computations

  • Authors:
  • Jack Poulson;Bryan Marker;Robert A. van de Geijn;Jeff R. Hammond;Nichols A. Romero

  • Affiliations:
  • University of Texas at Austin;University of Texas at Austin;University of Texas at Austin;Argonne Leadership Computing Facility;Argonne Leadership Computing Facility

  • Venue:
  • ACM Transactions on Mathematical Software (TOMS)
  • Year:
  • 2013

Quantified Score

Hi-index 0.01

Visualization

Abstract

Parallelizing dense matrix computations to distributed memory architectures is a well-studied subject and generally considered to be among the best understood domains of parallel computing. Two packages, developed in the mid 1990s, still enjoy regular use: ScaLAPACK and PLAPACK. With the advent of many-core architectures, which may very well take the shape of distributed memory architectures within a single processor, these packages must be revisited since the traditional MPI-based approaches will likely need to be extended. Thus, this is a good time to review lessons learned since the introduction of these two packages and to propose a simple yet effective alternative. Preliminary performance results show the new solution achieves competitive, if not superior, performance on large clusters.