An extended set of FORTRAN basic linear algebra subprograms
ACM Transactions on Mathematical Software (TOMS)
A set of level 3 basic linear algebra subprograms
ACM Transactions on Mathematical Software (TOMS)
Partitioning sparse matrices with eigenvectors of graphs
SIAM Journal on Matrix Analysis and Applications
Fortran 90 explained
Stopping criteria for iterative solvers
SIAM Journal on Matrix Analysis and Applications
An improved spectral graph partitioning algorithm for mapping parallel computations
SIAM Journal on Scientific Computing
ACM Transactions on Mathematical Software (TOMS)
Object-oriented design of preconditioned iterative methods in diffpack
ACM Transactions on Mathematical Software (TOMS)
Iterative methods for solving linear systems
Iterative methods for solving linear systems
Level 3 basic linear algebra subprograms for sparse matrices: a user-level interface
ACM Transactions on Mathematical Software (TOMS)
Basic Linear Algebra Subprograms for Fortran Usage
ACM Transactions on Mathematical Software (TOMS)
Computers and Intractability: A Guide to the Theory of NP-Completeness
Computers and Intractability: A Guide to the Theory of NP-Completeness
PARA '95 Proceedings of the Second International Workshop on Applied Parallel Computing, Computations in Physics, Chemistry and Engineering Science
An Object-Oriented Environment for Sparse Parallel Computation on Adaptive Grids
IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
LAPACK Working Note 94: A User''s Guide to the BLACS v1.0
LAPACK Working Note 94: A User''s Guide to the BLACS v1.0
An object-oriented framework for the development of scalable parallel multilevel preconditioners
ACM Transactions on Mathematical Software (TOMS)
Optimal sparse matrix dense vector multiplication in the I/O-model
Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
On the development of PSBLAS-based parallel two-level Schwarz preconditioners
Applied Numerical Mathematics
Performance Optimization and Modeling of Blocked Sparse Kernels
International Journal of High Performance Computing Applications
Mercury: a reflective middleware for automatic parallelization of Bags-of-Tasks
Proceedings of the 8th International Workshop on Adaptive and Reflective MIddleware
Distributed nonnegative matrix factorization for web-scale dyadic data analysis on mapreduce
Proceedings of the 19th international conference on World wide web
ACM Transactions on Mathematical Software (TOMS)
Asynchronous Communication Schemes for Finite Difference Methods on Multiple GPUs
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Mind the gap!: bridging the dichotomy of design and implementation
Proceedings of the 4th International Workshop on Software Engineering for Computational Science and Engineering
HPCC'06 Proceedings of the Second international conference on High Performance Computing and Communications
FAST-EVP: an engine simulation tool
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
Sparse matrix and solver objects for parallel finite element simulation of multi-field problems
HPCA'09 Proceedings of the Second international conference on High Performance Computing and Applications
Extending PSBLAS to build parallel schwarz preconditioners
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Design patterns for scientific computations on sparse matrices
Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing
Object-Oriented Techniques for Sparse Matrix Computations in Fortran 2003
ACM Transactions on Mathematical Software (TOMS)
Performance analysis of parallel Schwarz preconditioners in the LES of turbulent channel flows
Computers & Mathematics with Applications
Design patterns for sparse-matrix computations on hybrid CPU/GPU platforms
Scientific Programming
Hi-index | 0.00 |
Many computationally intensive problems in engineering and science give rise to the solution of large, sparse, linear systems of equations. Fast and efficient methods for their soltion are very important because these systems usually occur in the innermost loop of the computational scheme. Parallelization is often necessary to achieve an acceptable level of performance. This paper presents the design, implementation, and interface of a library of Basic Linear Algebra Subroutines for sparse matrices (PSBLAS) which is specifically tailored to distributed-memory computers. PSBLAS enables easy, efficient, and portable implementations of parallel iterative solvers for linear systems. The interface keeps in view a Single Program Multiple Data programming model on distributed-memory machines. However, the architecture of the library does not exclude an implementation in different paradigms, such as those based on the shared-memory model.