A modified tree code: don't laugh; it runs
Journal of Computational Physics
Scientific Simulations with Special Purpose Computers: The Grade Systems
Scientific Simulations with Special Purpose Computers: The Grade Systems
Microwulf: a beowulf cluster for every desk
Proceedings of the 39th SIGCSE technical symposium on Computer science education
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
190 TFlops Astrophysical N-body Simulation on a Cluster of GPUs
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Scalable fast multipole methods on distributed heterogeneous architectures
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
As an entry for the 2006 Gordon Bell price/performance prize, we report an astrophysical N-body simulation performed with reconfigurable computing systems and a hierarchical tree algorithm. Each of our systems consists of a host PC and a reconfigurable add-in card attached via high-speed interface, PCI Express or PCI-X. The reconfigurable add-in card houses one FPGA chip, into which we integrated 16 pipeline processors specialized for gravitational force calculation. Other operations, such as tree construction, tree traverse and time integration, are performed on the host PC. On our system, we performed a cosmological N-body simulation with 2.1 million particles, which sustained a performance of 15.39 Gflops averaged over 4.33 hours. The price of our system is $2,384 in total and, price per performance obtained is $158/Gflops, which is 44 times better than that obtained by GRAPE-5 special-purpose hardware, the 1999 Gordon Bell winner.