Implementing O(N) N-body algorithms efficiently in data-parallel languages

  • Authors:
  • Yu Hu;S. Lennart Johnsson

  • Affiliations:
  • -;-

  • Venue:
  • Scientific Programming
  • Year:
  • 1996

Quantified Score

Hi-index 0.01

Visualization

Abstract

The optimization techniques for hierarchical O(N) N-body algorithmsdescribed here focus on managing the data distribution and the datareferences, both between the memories of different nodes and withinthe memory hierarchy of each node. We show how the techniques canbe expressed in data-parallel languages, such as High PerformanceFortran (HPF) and Connection Machine Fortran (CMF). Theeffectiveness of our techniques is demonstrated on animplementation of Anderson's hierarchical O(N) N-body method forthe Connection Machine system CM-5/5E. Of the total execution time,communication accounts for about 10-20% of the total time, with theaverage efficiency for arithmetic operations being about 40% andthe total efficiency (including communication) being about 35%. Forthe CM-5E, a performance in excess of 60 Mflop/s per node (peak 160Mflop/s per node) has been measured.