High Performance Matrix Inversion on a Multi-core Platform with Several GPUs

  • Authors:
  • Pablo Ezzatti;Enrique S. Quintana-Orti;Alfredo Remon

  • Affiliations:
  • -;-;-

  • Venue:
  • PDP '11 Proceedings of the 2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Inversion of large-scale matrices appears in a few scientific applications like model reduction or optimal control. Matrix inversion requires an important computational effort and, therefore, the application of high performance computing techniques and architectures for matrices with dimension in the order of thousands. Following the recent up rise of graphics processors (GPUs), we present and evaluate high performance codes for matrix inversion, based on Gauss-Jordan elimination with partial pivoting, which off-load the main computational kernels to one or more GPUs while performing fine-grain operations on the general-purpose processor. The target architecture consists of a multi-core processor connected to several GPUs. Parallelism is extracted from parallel implementations of BLAS and from the concurrent execution of operations in the available computational units. Numerical experiments on a system with two Intel Quad Core processors and four NVIDIA c1060 GPUs illustrate the efficiency and the scalability of the different implementations, which deliver over 1.2脳1012 floating point operations per second.