QCDPAX-an MIMD array of vector processors for the numerical simulation of quantum chromodynamics
Proceedings of the 1989 ACM/IEEE conference on Supercomputing
The 1 Teraflops QCDSP computer
Parallel Computing - Special issue on high performance computing in lattice QCD
QCDOC: A 10 Teraflops Computer for Tightly-Coupled Calculations
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
The BlueGene/L supercomputer and quantum ChromoDynamics
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Overview of the Blue Gene/L system architecture
IBM Journal of Research and Development
Overview of the QCDSP and QCDOC computers
IBM Journal of Research and Development
The IBM Blue Gene/Q interconnection network and message unit
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
Lattice Quantum Chromodynamics (QCD) is one of the most challenging applications running on massively parallel supercomputers. To reproduce these physical phenomena on a supercomputer, a precise simulation is demanded requiring well optimized and scalable code. We have optimized lattice QCD programs on Blue Gene family supercomputers and shown the strength in lattice QCD simulation. Here we optimized on the third generation Blue Gene/Q supercomputer; i) by changing the data layout, ii) by exploiting new SIMD instruction sets, and iii) by pipelining boundary data exchange to overlap communication and calculation. The optimized lattice QCD program shows excellent weak scalability on the large scale Blue Gene/Q system, and with 16 racks we sustained 1.08 Pflop/s, 32.1% of the theoretical peak performance, including the conjugate gradient solver routines.