A parallel implementation of Strassen's matrix multiplication algorithm for wormhole-routed all-port 2D torus networks

  • Authors:
  • Cesur Baransel;Kayhan M. İmre

  • Affiliations:
  • Saltus Yazılım Ltd., Hacettepe University Technopolis, Ankara, Turkey;Dept. of Computer Engineering, Hacettepe University, Ankara, Turkey

  • Venue:
  • The Journal of Supercomputing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

A new parallel implementation of Strassen's matrix multiplication algorithm is proposed for massively parallel supercomputers with 2D, all-port torus interconnection networks. The proposed algorithm employs a special conflict-free routing pattern for better scalability and is able to yield a performance rate very close to the theoretical bound for many practical network and matrix sizes. It effectively scales up to very large networks typically containing hundreds-of-thousands processors where petaflop or exaflop processing rates are sought.