Acceleration of Fast Multipole Method Using Special-Purpose Computer Grape

  • Authors:
  • Nguyen Hai Chau;Atsushi Kawai;Toshikazu Ebisuzaki

  • Affiliations:
  • College of Technology, Vietnam National University, 144 Xuan Thuy, Cau Giay, Hanoi, Vietnam;K&f Computing Research Co., 1-21-6-407, Kojima-Cho, Chofu, Tokyo, Japan 182-0026;Computational Astrophysics Laboratory, Institute of Physical and Chemical Research, (RIKEN), Hirosawa 2-1, Wako-shi, Saitama, Japan 351-0198

  • Venue:
  • International Journal of High Performance Computing Applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We have implemented the fast multipole method (FMM) on a special-purpose computer GRAPE (GRAvity piPE). The FMM is one of the fastest approximate algorithms to calculate forces among particles. Its calculation cost scales as O(N), while the naive algorithm scales as O(N2). Here, N is the number of particles in the system. GRAPE is hardware dedicated to the calculation of Coulombic or gravitational forces among particles. GRAPE's calculation speed is 100—1000 times faster than that of conventional computers of the same price, though it cannot handle anything but force calculation. We can expect significant speedup by the combination of the fast algorithm and the fast hardware. However, a straightforward implementation of the algorithm actually runs on GRAPE at rather modest speed. This is because of the limited functionality of the hardware. Since GRAPE can handle particle forces only, just a small fraction of the overall calculation procedure can be put on it. The remaining part must be performed on a conventional computer connected to GRAPE. In order to take full advantage of the dedicated hardware, we modified the FMM using the pseudoparticle multipole method and Anderson's method. In the modified algorithm, multipole and local expansions are expressed by distribution of a small number of imaginary particles (pseudoparticles), and thus they can be evaluated by GRAPE. Results of numerical experiments on ordinary GRAPE systems show that, for large-N systems (N ≥ 105), GRAPE accelerates the FMM by a factor ranging from 3 for low accuracy (RMS relative force error ~10—2) to 60 for high accuracy (RMS relative force error ~10— 5). Performance of the FMM on GRAPE exceeds that of Barnes—Hut treecode on GRAPE at high accuracy, in case of close-to-uniform distribution of particles. However, in the same experimental environment the treecode outperforms the FMM for inhomogeneous distribution of particles.