Efficient parallel out-of-core matrix transposition

  • Authors:
  • Sriram Krishnamoorthy;Gerald Baumgartner;Daniel Cociorva;Chi-Chung Lam;P. Sadayappan

  • Affiliations:
  • Department of Computer and Information Science, 395, Dreese Laboratories, 2015 Neil Avenue, The Ohio State University, Columbus, OH 43210, USA.;Department of Computer and Information Science, 395, Dreese Laboratories, 2015 Neil Avenue, The Ohio State University, Columbus, OH 43210, USA.;Department of Computer and Information Science, 395, Dreese Laboratories, 2015 Neil Avenue, The Ohio State University, Columbus, OH 43210, USA.;Department of Computer and Information Science, 395, Dreese Laboratories, 2015 Neil Avenue, The Ohio State University, Columbus, OH 43210, USA.;Department of Computer and Information Science, 395, Dreese Laboratories, 2015 Neil Avenue, The Ohio State University, Columbus, OH 43210, USA

  • Venue:
  • International Journal of High Performance Computing and Networking
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses the problem of parallel transposition of large out-of-core arrays. Although algorithms for out-of-core matrix transposition have been widely studied, previously proposed algorithms have sought to minimise the number of I/O operations and the in-memory permutation time. We propose an algorithm that directly targets the improvement of overall transposition time. The I/O characteristics of the system are used to determine the read, write and communication block sizes such that the total execution time is minimised. We also provide a solution to the array redistribution problem for arrays on disk. The solutions to the sequential transposition problem and the parallel array redistribution problem are then combined to obtain an algorithm for the parallel out-of-core transposition problem.