An OpenMP implementation of parallel FFT and its performance on IA-64 processors

  • Authors:
  • Daisuke Takahashi;Mitsuhisa Sato;Taisuke Boku

  • Affiliations:
  • Institute of Information Sciences and Electronics, University of Tsukuba, Tsukuba, Ibaraki, Japan;Institute of Information Sciences and Electronics, University of Tsukuba, Tsukuba, Ibaraki, Japan;Institute of Information Sciences and Electronics, University of Tsukuba, Tsukuba, Ibaraki, Japan

  • Venue:
  • WOMPAT'03 Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming
  • Year:
  • 2003

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, we propose an OpenMP implementation of a recursive algorithm for parallel fast Fourier transform (FFT) on shared-memory parallel computers. A recursive three-step FFT algorithm improves performance by effectively utilizing the cache memory. Performance results of one-dimensional FFTs on the DELL PowerEdge 7150 and the hp workstation zx6000 are reported. We successfully achieved performance of about 757MFLOPS on the DELL PowerEdge 7150 (Itanium 800MHz, 4CPUs) and about 871MFLOPS on the hp workstation zx6000 (Itanium2 1GHz, 2CPUs) for 224-point FFT.