A proposal for a set of level 3 basic linear algebra subprograms
ACM SIGNUM Newsletter
Solution of large, dense symmetric generalized eigenvalue problems using secondary storage
ACM Transactions on Mathematical Software (TOMS)
A parallel QR factorization algorithm using local pivoting
Proceedings of the 1988 ACM/IEEE conference on Supercomputing
A block QR factorization algorithm using restricted pivoting
Proceedings of the 1989 ACM/IEEE conference on Supercomputing
A set of level 3 basic linear algebra subprograms
ACM Transactions on Mathematical Software (TOMS)
LAPACK: a portable linear algebra library for high-performance computers
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
Stability of block algorithms with fast level-3 BLAS
ACM Transactions on Mathematical Software (TOMS)
A parallel block implementation of Level-3 BLAS for MIMD vector processors
ACM Transactions on Mathematical Software (TOMS)
ICS '90 Proceedings of the 4th international conference on Supercomputing
Efficient householder QR factorization for superscalar processors
ACM Transactions on Mathematical Software (TOMS)
The Journal of Supercomputing
Portable and efficient factorization algorithms on the IBM 3090/VF
ICS '89 Proceedings of the 3rd international conference on Supercomputing
Blocked algorithms and software for reduction of a regular matrix pair to generalized Schur form
ACM Transactions on Mathematical Software (TOMS)
Computational Economics - Computational Studies at Stanford
A framework for symmetric band reduction
ACM Transactions on Mathematical Software (TOMS)
Algorithm 807: The SBR Toolbox—software for successive band reduction
ACM Transactions on Mathematical Software (TOMS)
Parallel Out-of-Core Cholesky and QR Factorization with POOCLAPACK
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
High-Performance Library Software for QR Factorization
PARA '00 Proceedings of the 5th International Workshop on Applied Parallel Computing, New Paradigms for HPC in Industry and Academia
Using Pentangular Factorizations for the Reduction to Banded Form
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
An Efficient Parallel Algorithm to Solve Block-Toeplitz Systems
The Journal of Supercomputing
Parallel out-of-core computation and updating of the QR factorization
ACM Transactions on Mathematical Software (TOMS)
Accumulating Householder transformations, revisited
ACM Transactions on Mathematical Software (TOMS)
Algorithm 854: Fortran 77 subroutines for computing the eigenvalues of Hamiltonian matrices II
ACM Transactions on Mathematical Software (TOMS)
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Cache efficient bidiagonalization using BLAS 2.5 operators
ACM Transactions on Mathematical Software (TOMS)
Parallel block tridiagonalization of real symmetric matrices
Journal of Parallel and Distributed Computing
Updating the QR decomposition of block tridiagonal and block Hessenberg matrices
Applied Numerical Mathematics
Design and Implementation of the ScaLAPACK LU, QR, and Cholesky Factorization Routines
Scientific Programming
QR factorization for the Cell Broadband Engine
Scientific Programming - High Performance Computing with the Cell Broadband Engine
Applying recursion to serial and parallel QR factorization leads to better performance
IBM Journal of Research and Development
Scaling LAPACK panel operations using parallel cache assignment
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Parallel algorithms for QR decomposition on a shared memory multiprocessor
Parallel Computing
Performance evaluation of parallel gram-schmidt re-orthogonalization methods
VECPAR'02 Proceedings of the 5th international conference on High performance computing for computational science
Implementing linear algebra routines on multi-core processors with pipelining and a look ahead
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Parallel tiled QR factorization for multicore architectures
PPAM'07 Proceedings of the 7th international conference on Parallel processing and applied mathematics
High-performance up-and-downdating via householder-like transformations
ACM Transactions on Mathematical Software (TOMS)
Algorithm 915, SuiteSparseQR: Multifrontal multithreaded rank-revealing sparse QR factorization
ACM Transactions on Mathematical Software (TOMS)
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Concurrency and Computation: Practice & Experience
Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems
SIAM Journal on Scientific Computing
Families of Algorithms for Reducing a Matrix to Condensed Form
ACM Transactions on Mathematical Software (TOMS)
Efficient generalized Hessenberg form and applications
ACM Transactions on Mathematical Software (TOMS)
Toward a scalable multi-GPU eigensolver via compute-intensive kernels and efficient communication
Proceedings of the 27th international ACM conference on International conference on supercomputing
Scaling LAPACK panel operations using parallel cache assignment
ACM Transactions on Mathematical Software (TOMS)
An improved parallel singular value algorithm and its implementation for multicore hardware
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A multicore solution to Block---Toeplitz linear systems of equations
The Journal of Supercomputing
Hi-index | 0.00 |