Efficient Adaptive Algorithms for Transposing Small and Large Matrices on Symmetric Multiprocessors

  • Authors:
  • Rami Al Na'mneh;W. David Pan;Seong-Moo Yoo

  • Affiliations:
  • Department of Electrical and Computer Engineering, University of Alabama in Huntsville, 301 Sparkman Drive, Huntsville, Alabama 35899, USA, e-mail: dwpan@ece.uah.edu;Department of Electrical and Computer Engineering, University of Alabama in Huntsville, 301 Sparkman Drive, Huntsville, Alabama 35899, USA, e-mail: dwpan@ece.uah.edu;Department of Electrical and Computer Engineering, University of Alabama in Huntsville, 301 Sparkman Drive, Huntsville, Alabama 35899, USA, e-mail: dwpan@ece.uah.edu

  • Venue:
  • Informatica
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Matrix transpose in parallel systems typically involves costly all-to-all communications. In this paper, we provide a comparative characterization of various efficient algorithms for transposing small and large matrices using the popular symmetric multiprocessors (SMP) architecture, which carries a relatively low communication cost due to its large aggregate bandwidth and low-latency inter-process communication. We conduct analysis on the cost of data sending / receiving and the memory requirement of these matrix-transpose algorithms. We then propose an adaptive algorithm that can minimize the overhead of the matrix transpose operations given the parameters such as the data size, number of processors, start-up time, and the effective communication bandwidth.