FT64: scientific computing with streams

  • Authors:
  • Mei Wen;Nan Wu;Chunyuan Zhang;Wei Wu;Qianming Yang;Changqing Xun

  • Affiliations:
  • National Laboratory for Parallel & Distributed Processing, National University of Defense Technology, Chang Sha, Hu Nan, P.R. of China;National Laboratory for Parallel & Distributed Processing, National University of Defense Technology, Chang Sha, Hu Nan, P.R. of China;National Laboratory for Parallel & Distributed Processing, National University of Defense Technology, Chang Sha, Hu Nan, P.R. of China;National Laboratory for Parallel & Distributed Processing, National University of Defense Technology, Chang Sha, Hu Nan, P.R. of China;National Laboratory for Parallel & Distributed Processing, National University of Defense Technology, Chang Sha, Hu Nan, P.R. of China;National Laboratory for Parallel & Distributed Processing, National University of Defense Technology, Chang Sha, Hu Nan, P.R. of China

  • Venue:
  • HiPC'07 Proceedings of the 14th international conference on High performance computing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes FT64 and Multi-FT64, single- and multicoprocessor systems designed for high performance scientific computing with streams. We give a detailed case study of porting the Mersenne Prime Search problem to FT64 and Multi-FT64 systems. We discuss several special problems associated with streamizing, such as kernel processing granularity, stream organization and workload partitioning for a multi-processor, which are generally applicable to other scientific codes on FT64. Finally, we perform experiments with eight typical scientific applications on FT64. The results show that a 500MHz FT64 achieves over 50% of its peak performance and a 4.2x peak speedup over 1.6GHz Itanium2. An eight processor Multi-FT64 system achieves 6.8x peak speedup over a single FT64.