Computational frameworks for the fast Fourier transform
Computational frameworks for the fast Fourier transform
An MPI Benchmark Program Library and Its Application to the Earth Simulator
ISHPC '02 Proceedings of the 4th International Symposium on High Performance Computing
Present Status of Development of the Earth Simulator
IWIA '01 Proceedings of the Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'01)
A 15.2 TFlops Simulation of Geodynamo on the Earth Simulator
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Evaluation of Cache-based Superscalar and Cacheless Vector Architectures for Scientific Computations
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Guest editorial: the earth simulator
Parallel Computing
Scalability of hybrid programming for a CFD code on the earth simulator
Parallel Computing
COTS Clusters vs. the Earth Simulator: An Application Study Using IMPACT-3D
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Cross-Site Computations on the TeraGrid
Computing in Science and Engineering
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Tera-Scalable Algorithms for Variable-Density Elliptic Hydrodynamics with Spectral Accuracy
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Benchmarks on tera-scalable models for DNS of turbulent channel flow
Parallel Computing
An on-chip cache design for vector processors
MEDEA '07 Proceedings of the 2007 workshop on MEmory performance: DEaling with Applications, systems and architecture
Bandwidth intensive 3-D FFT kernel for GPUs using CUDA
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Real-time fluid simulation using discrete sine/cosine transforms
Proceedings of the 2009 symposium on Interactive 3D graphics and games
Wavelet-Based Adaptive Solvers on Multi-core Architectures for the Simulation of Complex Systems
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
A model of small-scale turbulence for use in the PPM gas dynamics scheme
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
ISHPC'05/ALPS'06 Proceedings of the 6th international symposium on high-performance computing and 1st international conference on Advanced low power systems
Implications of memory performance for highly efficient supercomputing of scientific applications
ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
Scalable multi-GPU 3-D FFT for TSUBAME 2.0 supercomputer
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
A Parallel Domain Decomposition Method for 3D Unsteady Incompressible Flows at High Reynolds Number
Journal of Scientific Computing
Hi-index | 0.00 |
The high-resolution direct numerical simulations (DNSs) of incompressible turbulence with numbers of grid points up to 40963 have been executed on the Earth Simulator (ES). The DNSs are based on the Fourier spectral method, so that the equation for mass conservation is accurately solved. In DNS based on the spectral method, most of the computation time is consumed in calculating the three-dimensional (3D) Fast Fourier Transform (FFT), which requires huge-scale global data transfer and has been the major stumbling block that has prevented truly high-performance computing. By implementing new methods to efficiently perform the 3D-FFT on the ES, we have achieved DNS at 16.4 Tflops on 20483 grid points. The DNS yields an energy spectrum exhibiting a wide inertial subrange, in contrast to previous DNSs with lower resolutions, and therefore provides valuable data for the study of the universal features of turbulence at large Reynolds number.