A fast algorithm for particle simulations
Journal of Computational Physics
Computer simulation using particles
Computer simulation using particles
A modified tree code: don't laugh; it runs
Journal of Computational Physics
Yet another fast multipole method without multipoles—pseudoparticle multipole method
Journal of Computational Physics
Scientific Simulations with Special Purpose Computers: The Grade Systems
Scientific Simulations with Special Purpose Computers: The Grade Systems
Anton, a special-purpose machine for molecular dynamics simulation
Proceedings of the 34th annual international symposium on Computer architecture
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
190 TFlops Astrophysical N-body Simulation on a Cluster of GPUs
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
A sparse octree gravitational N-body code that runs entirely on the GPU processor
Journal of Computational Physics
Hi-index | 0.00 |
In this paper, we describe the design and performance of GRAPE-8 accelerator processor for gravitational N-body simulations. It is designed to evaluate gravitational interaction with cutoff between particles. The cutoff function is useful for schemes like TreePM or Particle-Particle Particle-Tree, in which gravitational force is divided to short-range and long-range components. A single GRAPE-8 processor chip integrates 48 pipeline processors. The effective number of floating-point operations per interaction is around 40. Thus the peak performance of a single GRAPE-8 processor chip is 480 Gflops. A GRAPE-8 processor card houses two GRAPE-8 chips and one FPGA chip for PCI-Express interface. The total power consumption of the board is 46W. Thus, theoretical peak performance per watt is 20.5 Gflops/W. The effective performance of the total system, including the host computer, is around 5Gflops/W. This is more than a factor of two higher than the highest number in the current Green500 list.