ScaLAPACK user's guide
LAPACK Users' guide (third ed.)
LAPACK Users' guide (third ed.)
Parallel Solving Symmetric Eigenproblems
ICA3PP '02 Proceedings of the Fifth International Conference on Algorithms and Architectures for Parallel Processing
SIAM Journal on Scientific Computing
On parallelizing the MRRR algorithm for data-parallel coprocessors
PPAM'09 Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part I
Hi-index | 0.00 |
A generalized eigensystem problem is usually transformed, utilizing Cholesky decomposition, to a standard eigenproblem. The latter is then solved efficiently by a matrix reduction approach based on Householder tridiagonalization method. We present parallel implementation of an integrated transformation-reduction algorithm on GPU accelerator using CUBLAS. Experimental results clearly demonstrate the potential of data-parallel coprocessors for scientific computations. When comparing against the CPU implementation, the GPU implementations achieve above 16-fold and 26-fold speedups in double precision for reduction and transformation respectively.