LAPACK: a portable linear algebra library for high-performance computers
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
Threshold pivoting for dense LU factorization on distributed memory multiprocessors
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Stability of block algorithms with fast level-3 BLAS
ACM Transactions on Mathematical Software (TOMS)
Speedup of band linear recurrences in the presence of resource constraints
ICS '92 Proceedings of the 6th international conference on Supercomputing
A parallel block implementation of Level-3 BLAS for MIMD vector processors
ACM Transactions on Mathematical Software (TOMS)
MOB forms: a class of multilevel block algorithms for dense linear algebra operations
ICS '94 Proceedings of the 8th international conference on Supercomputing
IBM Journal of Research and Development
Data prefetching and multilevel blocking for linear algebra operations
ICS '96 Proceedings of the 10th international conference on Supercomputing
Block algorithms for sparse matrix computations on high performance workstations
ICS '96 Proceedings of the 10th international conference on Supercomputing
Computing Programs Containing Band Linear Recurrences on Vector Supercomputers
IEEE Transactions on Parallel and Distributed Systems
Highly Scalable Parallel Algorithms for Sparse Matrix Factorization
IEEE Transactions on Parallel and Distributed Systems
Auto-blocking matrix-multiplication or tracking BLAS3 performance from source code
PPOPP '97 Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming
Generating an Efficient Broadcast Sequence Using Reflected Gray Codes
IEEE Transactions on Parallel and Distributed Systems
Compiler blockability of dense matrix factorizations
ACM Transactions on Mathematical Software (TOMS)
A scalable parallel algorithm for sparse Cholesky factorization
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
A Columnwise Block Striping in Neville Elimination
PPAM '01 Proceedings of the th International Conference on Parallel Processing and Applied Mathematics-Revised Papers
Block-Striped Partitioning and Neville Elimination
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
A faster algorithm for solving linear algebraic equations on the star graph
Journal of Parallel and Distributed Computing
Parallel algorithms for Markov chain Monte Carlo methods in latent spatial Gaussian models
Statistics and Computing
Communication-efficient parallel generic pairwise elimination
Future Generation Computer Systems - Special section: Information engineering and enterprise architecture in distributed computing environments
Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines
Scientific Programming
International Journal of Computer Mathematics
Hi-index | 0.00 |