A Stream Processor Cluster Architecture Model with the Hybrid Technology of MPI and CUDA

  • Authors:
  • Qing-kui Chen;Jia-kang Zhang

  • Affiliations:
  • -;-

  • Venue:
  • ICISE '09 Proceedings of the 2009 First IEEE International Conference on Information Science and Engineering
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

Nowadays, the compute capability of traditional cluster system can't keep up with the computing needs of a practical application, and these aspects of energy, space technology, etc. have become a huge problem. However, as parallel computing equipment, the stream processor (SP) has a high performance of floating-point operations. NVIDIA GPUs is a typical stream processor device, CUDA technology enables the way to develop a better parallel program on GPUs to become flexible. In this paper, we make use of the hybrid parallel computing programming environment (HPCPE) with MPI and CUDA technology to build the simple CPU + GPU-based stream processor cluster system. In addition, we also proposed the "Two Level Model (TLM)" to separate the intensive computing tasks and controlling tasks, and exploit the compute capability of contemporary GPUs to accelerate computing tasks. Finally, we conducted a relevant experiment about the calculation of N-Body problem, and verified the better performance that stream processor cluster system has than the traditional one.