Optimizing sweep3d for graphic processor unit

  • Authors:
  • Chunye Gong;Jie Liu;Zhenghu Gong;Jin Qin;Jing Xie

  • Affiliations:
  • Department of Computer Sciences, National University of Defense Technology, Changsha, China;Department of Computer Sciences, National University of Defense Technology, Changsha, China;Department of Computer Sciences, National University of Defense Technology, Changsha, China;Department of Computer Sciences, National University of Defense Technology, Changsha, China;Department of Computer Sciences, National University of Defense Technology, Changsha, China

  • Venue:
  • ICA3PP'10 Proceedings of the 10th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

As a powerful and flexible processor, the Graphic Processing Unit (GPU) can offer great faculty in solving many high-performance computing applications Sweep3D, which simulates a single group time-independent discrete ordinates (Sn) neutron transport deterministically on 3D Cartesian geometry space, represents the key part of a real ASCI application The wavefront process for parallel computation in Sweep3D limits the concurrent threads on the GPU In this paper, we present multi-dimensional optimization methods for Sweep3D, which can be efficiently implemented on the fine grained parallel architecture of the GPU Our results show that the performance of overall Sweep3D on CPU-GPU hybrid platform can be improved up to 2.25 times as compared to the CPU-based implementation.