Distribution of mathematical software via electronic mail
Communications of the ACM
The use of BLAS3 in linear algebra on a parallel processor with a hierarchical memory
SIAM Journal on Scientific and Statistical Computing
The WY representation for products of householder matrices
SIAM Journal on Scientific and Statistical Computing - Papers from the Second Conference on Parallel Processing for Scientific Computin
An extended set of FORTRAN basic linear algebra subprograms
ACM Transactions on Mathematical Software (TOMS)
ACM Transactions on Mathematical Software (TOMS)
Basic Linear Algebra Subprograms for Fortran Usage
ACM Transactions on Mathematical Software (TOMS)
Algorithm 539: Basic Linear Algebra Subprograms for Fortran Usage [F1]
ACM Transactions on Mathematical Software (TOMS)
A parallel block implementation of Level-3 BLAS for MIMD vector processors
ACM Transactions on Mathematical Software (TOMS)
ICS '90 Proceedings of the 4th international conference on Supercomputing
Hi-index | 0.00 |
This paper describes a series of experiments performed with block versions of the LU, Cholesky and QR factorizations using Level 3 BLAS on one processor of the IBM 3090/VF. We show that the LAPACK approach to designing linear algebra software that is both portable and efficient, namely calling optimized versions of the Level 3 BLAS kernels, is likely to be a successful way to exploit machines with hierarchical memories.