Parallel Out-of-Core Matrix Inversion

Authors:
Eddy Caron;Gil Utard
Affiliations:
-;-
Venue:
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Year:
2002

Citing 5
Cited 4

The design and implementation of SOLAR, a portable library for scalable out-of-core linear algebra computations

Proceedings of the fourth workshop on I/O in parallel and distributed systems: part of the federated computing research conference
ScaLAPACK user's guide

ScaLAPACK user's guide
Performance Prediction and Analysis of Parallel Out-Of-Core Matrix Factorization

HiPC '00 Proceedings of the 7th International Conference on High Performance Computing
Virtual Memory Management in Data Parallel Applications

HPCN Europe '99 Proceedings of the 7th International Conference on High-Performance Computing and Networking
POOCLAPACK: Parallel Out-of-Core Linear Algebra Package

POOCLAPACK: Parallel Out-of-Core Linear Algebra Package

On the performance of parallel factorization of out-of-core matrices

Parallel Computing
Efficiently using memory in lattice Boltzmann simulations

Future Generation Computer Systems - Special issue: Computational science of lattice Boltzmann modelling
Out-of-Core and Pipeline Techniques for Wavefront Algorithms

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
A Fine-Grained Pipelined Implementation for Large-Scale Matrix Inversion on FPGA

APPT '09 Proceedings of the 8th International Symposium on Advanced Parallel Processing Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a parallel out-of-core algorithm to invert huge dense matrices, that is matrices larger than the available physical memory by one or more orders of magnitude. Preliminary performance results are shown for a commodity cluster. An accurate prediction performance model of the algorithm is given. Thanks to the prediction model, optimizations that avoid the overhead of the out-of-core algorithm are derived. Performance of the optimized algorithm using O(N) memory size are similar to the performance of the best known parallel in-core algorithm using O(N2) memory size (where N is the matrix order). There is no memory restriction for inversion of huge matrices!