Parallel Arnoldi eigensolvers with enhanced scalability via global communications rearrangement

Authors:
V. Hernandez;J. E. Roman;A. Tomas
Affiliations:
D. Sistemas Informáticos y Computación, Universidad Politécnica de Valencia, Camino de Vera s/n, 46022 Valencia, Spain;D. Sistemas Informáticos y Computación, Universidad Politécnica de Valencia, Camino de Vera s/n, 46022 Valencia, Spain;D. Sistemas Informáticos y Computación, Universidad Politécnica de Valencia, Camino de Vera s/n, 46022 Valencia, Spain
Venue:
Parallel Computing
Year:
2007

Citing 16
Cited 11

Implementation of the GMRES method using householder transformations

SIAM Journal on Scientific and Statistical Computing - Telecommunication Programs at U.S. Universities
Sparse matrix test problems

ACM Transactions on Mathematical Software (TOMS)
Iterative algorithms for Gram-Schmidt orthogonalization

Computing
Krylov subspace methods on supercomputers

SIAM Journal on Scientific and Statistical Computing
Algorithm 686: FORTRAN subroutines for updating the QR decomposition

ACM Transactions on Mathematical Software (TOMS)
Reducing the effect of global communication in GMRES(m) and CG on parallel distributed memory computers

Applied Numerical Mathematics
On restarting the Arnoldi method for large nonsymmetric eigenvalue problems

Mathematics of Computation
An Efficient Implementation of the Nonsymmetric Lanczos Algorithm

SIAM Journal on Matrix Analysis and Applications
A block variant of the GMRES method on massively parallel processors

Parallel Computing
Parallel implementation of a multiblock method with approximate subdomain solution

Applied Numerical Mathematics
Parallel empirical pseudopotential electronic structure calculations for million atom systems

Journal of Computational Physics
A Block Orthogonalization Procedure with Constant Synchronization Requirements

SIAM Journal on Scientific Computing
A Test Matrix Collection for Non-Hermitian Eigenvalue Problems

A Test Matrix Collection for Non-Hermitian Eigenvalue Problems
Algorithm 842: A set of GMRES routines for real and complex arithmetics on high performance computers

ACM Transactions on Mathematical Software (TOMS)
SLEPc: A scalable and flexible toolkit for the solution of eigenvalue problems

ACM Transactions on Mathematical Software (TOMS) - Special issue on the Advanced CompuTational Software (ACTS) Collection
Evaluation of several variants of explicitly restarted lanczos eigensolvers and their parallel implementations

VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science

A Parallel Implementation of the Trace Minimization Eigensolver

High Performance Computing for Computational Science - VECPAR 2008
PRIMME: preconditioned iterative multimethod eigensolver—methods and software description

ACM Transactions on Mathematical Software (TOMS)
Fast eigenvalue calculations in a massively parallel plasma turbulence code

Parallel Computing
A Parallel implementation of the Jacobi-Davidson eigensolver and its application in a plasma turbulence code

Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
Performance and numerical accuracy evaluation of heterogeneous multicore systems for Krylov orthogonal basis computation

VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
A parallel implementation of the Jacobi-Davidson eigensolver for unsymmetric matrices

VECPAR'10 Proceedings of the 9th international conference on High performance computing for computational science
A parallel subdomain by subdomain implementation of the implicitly restarted Arnoldi/Lanczos method

Computational Mechanics
A parallel solution of large-scale heat equation based on distributed memory hierarchy system

ICA3PP'10 Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part II
Strategies for spectrum slicing based on restarted Lanczos methods

Numerical Algorithms
CUDA acceleration of a matrix-free Rosenbrock-K method applied to the shallow water equations

ScalA '13 Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems
A parallel implementation of Davidson methods for large-scale eigenvalue problems in SLEPc

ACM Transactions on Mathematical Software (TOMS)

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper presents several new variants of the single-vector Arnoldi algorithm for computing approximations to eigenvalues and eigenvectors of a non-symmetric matrix. The context of this work is the efficient implementation of industrial-strength, parallel, sparse eigensolvers, in which robustness is of paramount importance, as well as efficiency. For this reason, Arnoldi variants that employ Gram-Schmidt with iterative reorthogonalization are considered. The proposed algorithms aim at improving the scalability when running in massively parallel platforms with many processors. The main goal is to reduce the performance penalty induced by global communications required in vector inner products and norms. In the proposed algorithms, this is achieved by reorganizing the stages that involve these operations, particularly the orthogonalization and normalization of vectors, in such a way that several global communications are grouped together while guaranteeing that the numerical stability of the process is maintained. The numerical properties of the new algorithms are assessed by means of a large set of test matrices. Also, scalability analyses show a significant improvement in parallel performance.