The Journal of Supercomputing
Performance Evaluation of the SGI Altix 3700
ICPP '05 Proceedings of the 2005 International Conference on Parallel Processing
An Application-Based Performance Characterization of the Columbia Supercluster
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
High Resolution Aerospace Applications using the NASA Columbia Supercomputer
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Performance characteristics of the multi-zone NAS parallel benchmarks
Journal of Parallel and Distributed Computing - Special issue: 18th International parallel and distributed processing symposium
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Application-based early performance evaluation of SGI altix 4700 systems for SGI systems
Proceedings of the 5th conference on Computing frontiers
Performance evaluation of supercomputers using HPCC and IMB Benchmarks
Journal of Computer and System Sciences
Early performance evaluation of a "Nehalem" cluster using scientific and engineering applications
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Automatic generation of executable communication specifications from parallel applications
Proceedings of the international conference on Supercomputing
Scientific application based performance on magic cube
HPCA'09 Proceedings of the Second international conference on High Performance Computing and Applications
OpenMP parallelism for fluid and fluid-particulate systems
Parallel Computing
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.01 |
The suitability of next-generation high-performance computing systems for petascale simulations will depend on various performance factors attributable to processor, memory, local and global network, and input/output characteristics. In this paper, we evaluate performance of new dual-core SGI Altix 4700, quad-core SGI Altix ICE 8200, and dual-core IBM POWER5+ systems. To measure performance, we used micro-benchmarks from High Performance Computing Challenge (HPCC), NAS Parallel Benchmarks (NPB), and four real-world applications---three from computational fluid dynamics (CFD) and one from climate modeling. We used the micro-benchmarks to develop a controlled understanding of individual system components, then analyzed and interpreted performance of the NPBs and applications. We also explored the hybrid programming model (MPI+OpenMP) using multi-zone NPBs and the CFD application OVERFLOW-2. Achievable application performance is compared across the systems. For the ICE platform, we also investigated the effect of memory bandwidth on performance by testing 1, 2, 4, and 8 cores per node.