Nearest-neighbor mapping of finite element graphs onto processor meshes
IEEE Transactions on Computers
What have we learnt from using real parallel machines to solve real problems?
C3P Proceedings of the third conference on Hypercube concurrent computers and applications - Volume 2
Hi-index | 0.00 |
In this paper, parallel implementation and vectorization of the Scaled Conjugate Gradient (SCG) algorithm for the solution of large sparse linear system of equations, on a vector hypercube multiprocessor (iPSC- VX), is described. Computations in the SCG algorithm consist mainly of matrix operations that can be vectorized and are implemented on the Vector Processor on each node of the hypercube. The implementation described here achieves efficient parallelization by overlapping vectorized computations with inter-node communication. A speed-up of 58 over a µVax II is obtained for large finite element meshes.