Broadcast-Based parallel LU factorization

  • Authors:
  • Fernando G. Tinetti;Armando E. De Giusti

  • Affiliations:
  • III-LIDI, Facultad de informática, Universidad Nacional de La Plata, La Plata, Argentina;III-LIDI, Facultad de informática, Universidad Nacional de La Plata, La Plata, Argentina

  • Venue:
  • Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a parallel LU factorization algorithm designed to take advantage of physical broadcast communication facilities as well as overlapping of communication and computing. Physical broadcast is directly available on Ethernet networks hardware, one of the most used interconnection networks in current clusters installed for parallel computing. Overlapped communication is a well-known strategy for hiding communication latency, which is one of the most common source of parallel performance penalization. Performance analysis and experimentation of the proposed parallel LU factorization algorithm are presented. Also, the performance of the proposed algorithm is compared with that of the algorithm used in ScaLAPACK (Scalable LAPACK), which is commonly accepted as having optimized performance.