Adapting a parallel sparse direct solver to architectures with clusters of SMPs

  • Authors:
  • Patrick R. Amestoy;Iain S. Duff;Stéphane Pralet;Christof Vömel

  • Affiliations:
  • ENSEEIHT, 2 rue Camichel, BP 7122--F 31071 Toulouse Cedex 7, France;CERFACS, Toulouse, and Atlas Centre, RAL, Oxon OX11 0QX, UK;CERFACS, 42, av. G. Coriolis, 31057 Toulouse Cedex 01, France;CERFACS, 42, av. G. Coriolis, 31057 Toulouse Cedex 01, France

  • Venue:
  • Parallel Computing - Special issue: Parallel and distributed scientific and engineering computing
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider the direct solution of general sparse linear systems baseds on a multifrontal method. The approach combines partial static scheduling of the task dependency graph during the symbolic factorization and distributed dynamic scheduling during the numerical factorization to balance the work among the processes of a distributed memory computer. We show that to address clusters of Symmetric Multi-Processor (SMP) architectures, and more generally non-uniform memory access multiprocessors, our algorithms for both the static and the dynamic scheduling need to be revisited to take account of the non-uniform cost of communication. The performance analysis on an IBM SP3 with 16 processors per SMP node and up to 128 processors shows that we can significantly reduce both the amount of inter-node communication and the solution time.