ACM Transactions on Mathematical Software (TOMS)
ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
Software support for speculative loads
ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
Reducing memory latency via non-blocking and prefetching caches
ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
Design and evaluation of a compiler algorithm for prefetching
ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
Exploiting functional parallelism of POWER2 to design high-performance numerical algorithms
IBM Journal of Research and Development
Lattice QCD on the IBM scalable POWERParallel Systems SP2
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Data prefetching and multilevel blocking for linear algebra operations
ICS '96 Proceedings of the 10th international conference on Supercomputing
Improving the memory-system performance of sparse-matrix vector multiplication
IBM Journal of Research and Development
GEMM-based level 3 BLAS: high-performance model implementations and performance evaluation benchmark
ACM Transactions on Mathematical Software (TOMS)
Matrix multiplication: a case study of enhanced data cache utilization
Journal of Experimental Algorithmics (JEA)
Parallel and Fully Recursive Multifrontal Supernodal Sparse Cholesky
ICCS '02 Proceedings of the International Conference on Computational Science-Part II
Parallel and fully recursive multifrontal sparse Cholesky
Future Generation Computer Systems - Special issue: Selected numerical algorithms
Communication lower bounds for distributed-memory matrix multiplication
Journal of Parallel and Distributed Computing
A parallel symmetric block-tridiagonal divide-and-conquer algorithm
ACM Transactions on Mathematical Software (TOMS)
High Performance Implementation of Binomial Option Pricing
ICCSA '08 Proceeding sof the international conference on Computational Science and Its Applications, Part I
Cache-optimal algorithms for option pricing
ACM Transactions on Mathematical Software (TOMS)
Minimal-storage high-performance Cholesky factorization via blocking and recursion
IBM Journal of Research and Development
A matrix-type for performance–portability
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Hi-index | 0.00 |