Neutron sensitivity and software hardening strategies for matrix multiplication and FFT on graphics processing units

  • Authors:
  • Paolo Rech;Laercio Pilla;Francesco Silvestri;Philippe Navaux;Luigi Carro

  • Affiliations:
  • UFRGS, Porto Alegre, Brazil;UFRGS, Porto Alegre, Brazil;Università di Padova, Padova, Italy;UFRGS, Porto Alegre, Brazil;UFRGS, Porto Alegre, Brazil

  • Venue:
  • Proceedings of the 3rd Workshop on Fault-tolerance for HPC at extreme scale
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we compare the radiation response of GPUs executing matrix multiplication and FFT algorithms. The provided experimental results demonstrate that for both algorithms, in the majority of cases, the output is affected by multiple errors. The architectural and code analysis highlight that multiple errors are caused by shared resources corruption or thread dependencies. The experimental data and analytical studies can be fruitfully employed to evaluate the expected error rate of GPUs in realistic applications and to design specific and optimized software-based hardening procedures.