VECPAR '00 Selected Papers and Invited Talks from the 4th International Conference on Vector and Parallel Processing
A Fine-Grained Pipelined Implementation for Large-Scale Matrix Inversion on FPGA
APPT '09 Proceedings of the 8th International Symposium on Advanced Parallel Processing Technologies
Hi-index | 0.00 |
We present a new parallel matrix inversion algorithm and report its implementation on parallel computers with distributed memory. The algorithm features natural load balance, simple programming and easy performance optimization, while maintaining the same arithmetic cost and numerical properties of the conventional inversion algorithm. Our analysis and experiments on a Cray T3E report near-peak performance for the new approach.