Using Strassen's algorithm to accelerate the solution of linear systems
The Journal of Supercomputing
ScaLAPACK user's guide
Fast runtime block cyclic data redistribution on multiprocessors
Journal of Parallel and Distributed Computing
Recursive array layouts and fast parallel matrix multiplication
Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
Tuning Strassen's matrix multiplication for memory efficiency
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Approaches for Integrating Task and Data Parallelism
IEEE Concurrency
Performance Prediction and Analysis of Parallel Out-Of-Core Matrix Factorization
HiPC '00 Proceedings of the 7th International Conference on High Performance Computing
Efficient Procedures for Using Matrix Algorithms
Proceedings of the 2nd Colloquium on Automata, Languages and Programming
GEEM-Based Level 3 BLAS: High-Performance Model Implementations and Performance Evaluation Benchmark
GEEM-Based Level 3 BLAS: High-Performance Model Implementations and Performance Evaluation Benchmark
Simultaneous exploitation of task and data parallelism in regular scientific applications
Simultaneous exploitation of task and data parallelism in regular scientific applications
Hi-index | 0.00 |