Introduction to parallel computing: design and analysis of algorithms
Introduction to parallel computing: design and analysis of algorithms
IBM Journal of Research and Development
Array decompositions for nonuniform computational environments
Journal of Parallel and Distributed Computing
ScaLAPACK user's guide
The grid: blueprint for a new computing infrastructure
The grid: blueprint for a new computing infrastructure
The grid
HPCN Europe '99 Proceedings of the 7th International Conference on High-Performance Computing and Networking
Matrix-Matrix Multiplication on Heterogeneous Platforms
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Adaptive parallel computing on heterogeneous networks with mpC
Parallel Computing
GREMLINS: a large sparse linear solver for grid environment
Parallel Computing
NPC'10 Proceedings of the 2010 IFIP international conference on Network and parallel computing
Hi-index | 0.00 |
In this paper, the authors deal with algorithmic issues on heterogeneous platforms. They concentrate on dense linear algebra kernels, such as matrix multiplication or LU decomposition. Block-cyclic distribution techniques used in ScaLAPACK are no longer sufficient to balance the load among processors running at different speeds. The main result of this paper is to provide a static data distribution scheme that leads to an asymptotically perfect load balancing for LU decomposition, thereby providing solid foundations toward the design of a cluster-oriented version of ScaLAPACK.