PSBLAS: a library for parallel linear algebra computation on sparse matrices

Authors:
Salvatore Filippone;Michele Colajanni
Affiliations:
Univ. di Roma Tor Vergata, Rome, Italy;Univ. di Modena e Reggio Emilia, Modena, Italy
Venue:
ACM Transactions on Mathematical Software (TOMS)
Year:
2000

Citing 15
Cited 19

An extended set of FORTRAN basic linear algebra subprograms

ACM Transactions on Mathematical Software (TOMS)
A set of level 3 basic linear algebra subprograms

ACM Transactions on Mathematical Software (TOMS)
Partitioning sparse matrices with eigenvectors of graphs

SIAM Journal on Matrix Analysis and Applications
Fortran 90 explained

Fortran 90 explained
Stopping criteria for iterative solvers

SIAM Journal on Matrix Analysis and Applications
An improved spectral graph partitioning algorithm for mapping parallel computations

SIAM Journal on Scientific Computing
Fortran 90: an entry to object-oriented programming for the solution of partial differential equations

ACM Transactions on Mathematical Software (TOMS)
Object-oriented design of preconditioned iterative methods in diffpack

ACM Transactions on Mathematical Software (TOMS)
Iterative methods for solving linear systems

Iterative methods for solving linear systems
Level 3 basic linear algebra subprograms for sparse matrices: a user-level interface

ACM Transactions on Mathematical Software (TOMS)
Basic Linear Algebra Subprograms for Fortran Usage

ACM Transactions on Mathematical Software (TOMS)
Computers and Intractability: A Guide to the Theory of NP-Completeness

Computers and Intractability: A Guide to the Theory of NP-Completeness
ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers - Design Issues and Performance

PARA '95 Proceedings of the Second International Workshop on Applied Parallel Computing, Computations in Physics, Chemistry and Engineering Science
An Object-Oriented Environment for Sparse Parallel Computation on Adaptive Grids

IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
LAPACK Working Note 94: A User''s Guide to the BLACS v1.0

LAPACK Working Note 94: A User''s Guide to the BLACS v1.0

An object-oriented framework for the development of scalable parallel multilevel preconditioners

ACM Transactions on Mathematical Software (TOMS)
Optimal sparse matrix dense vector multiplication in the I/O-model

Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
On the development of PSBLAS-based parallel two-level Schwarz preconditioners

Applied Numerical Mathematics
Performance Optimization and Modeling of Blocked Sparse Kernels

International Journal of High Performance Computing Applications
Mercury: a reflective middleware for automatic parallelization of Bags-of-Tasks

Proceedings of the 8th International Workshop on Adaptive and Reflective MIddleware
Distributed nonnegative matrix factorization for web-scale dyadic data analysis on mapreduce

Proceedings of the 19th international conference on World wide web
MLD2P4: A Package of Parallel Algebraic Multilevel Domain Decomposition Preconditioners in Fortran 95

ACM Transactions on Mathematical Software (TOMS)
Asynchronous Communication Schemes for Finite Difference Methods on Multiple GPUs

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Hogs and slackers: Using operations balance in a genetic algorithm to optimize sparse algebra computation on distributed architectures

Parallel Computing
Mind the gap!: bridging the dichotomy of design and implementation

Proceedings of the 4th International Workshop on Software Engineering for Computational Science and Engineering
An enhanced parallel version of kiva–3v, coupled with a 1d CFD code, and its use in general purpose engine applications

HPCC'06 Proceedings of the Second international conference on High Performance Computing and Communications
FAST-EVP: an engine simulation tool

HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Sparse matrix and solver objects for parallel finite element simulation of multi-field problems

HPCA'09 Proceedings of the Second international conference on High Performance Computing and Applications
Extending PSBLAS to build parallel schwarz preconditioners

PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Design patterns for scientific computations on sparse matrices

Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing
Object-Oriented Techniques for Sparse Matrix Computations in Fortran 2003

ACM Transactions on Mathematical Software (TOMS)
Performance analysis of parallel Schwarz preconditioners in the LES of turbulent channel flows

Computers & Mathematics with Applications
On the Easy Use of Scientific Computing Services for Large Scale Linear Algebra and Parallel Decision Making with the P-Grade Portal

Journal of Grid Computing
Design patterns for sparse-matrix computations on hybrid CPU/GPU platforms

Scientific Programming

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many computationally intensive problems in engineering and science give rise to the solution of large, sparse, linear systems of equations. Fast and efficient methods for their soltion are very important because these systems usually occur in the innermost loop of the computational scheme. Parallelization is often necessary to achieve an acceptable level of performance. This paper presents the design, implementation, and interface of a library of Basic Linear Algebra Subroutines for sparse matrices (PSBLAS) which is specifically tailored to distributed-memory computers. PSBLAS enables easy, efficient, and portable implementations of parallel iterative solvers for linear systems. The interface keeps in view a Single Program Multiple Data programming model on distributed-memory machines. However, the architecture of the library does not exclude an implementation in different paradigms, such as those based on the shared-memory model.