A new approach for automatic parallelization of blocked linear Algebra computations
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Threshold pivoting for dense LU factorization on distributed memory multiprocessors
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
On the parallelization of blocked LU factorization algorithms on distributed memory architectures
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines
Scientific Programming
Benchmarking GPUs to tune dense linear algebra
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Hi-index | 0.00 |