Terascale spectral element algorithms and implementations
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Initial Performance Evaluation of the Cray SeaStar Interconnect
HOTI '05 Proceedings of the 13th Symposium on High Performance Interconnects
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Entering the petaflop era: the architecture and performance of Roadrunner
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Early performance evaluation of a "Nehalem" cluster using scientific and engineering applications
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Optimized InfiniBandTM fat-tree routing for shift all-to-all communication patterns
Concurrency and Computation: Practice & Experience - International Supercomputing Conference (ISC07)
International Journal of High Performance Computing Applications
The Gemini System Interconnect
HOTI '10 Proceedings of the 2010 18th IEEE Symposium on High Performance Interconnects
The structural simulation toolkit
ACM SIGMETRICS Performance Evaluation Review - Special issue on the 1st international workshop on performance modeling, benchmarking and simulation of high performance computing systems (PMBS 10)
A Performance Model of Direct Numerical Simulation for Analyzing Large-Scale Systems
IPDPSW '11 Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum
Investigating the Impact of the Cielo Cray XE6 Architecture on Scientific Application Codes
IPDPSW '11 Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum
Architectural Constraints to Attain 1 Exaflop/s for Three Scientific Application Classes
IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
The IBM Blue Gene/Q interconnection network and message unit
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
The IBM Blue Gene/Q Compute Chip
IEEE Micro
The IBM Blue Gene/Q Interconnection Fabric
IEEE Micro
Comparing the Performance of Blue Gene/Q with Leading Cray XE6 and InfiniBand Systems
ICPADS '12 Proceedings of the 2012 IEEE 18th International Conference on Parallel and Distributed Systems
Hi-index | 0.00 |
We present here a performance analysis of three of current architectures that have become commonplace in the High Performance Computing world. Blue Gene/Q is the third generation of systems from IBM that use modestly performing cores but at large-scale in order to achieve high performance. The XE6 is the latest in a long line of Cray systems that use a 3-D topology but the first to use its Gemini interconnection network. InfiniBand provides the flexibility of using compute nodes from many vendors that can be connected in many possible topologies. The performance characteristics of each vary vastly, and the way in which nodes are allocated in each type of system can significantly impact on achieved performance. In this work we compare these three systems using a combination of micro-benchmarks and a set of production applications. In addition we also examine the differences in performance variability observed on each system and quantify the lost performance using a combination of both empirical measurements and performance models. Our results show that significant performance can be lost in normal production operation of the Cray XE6 and InfiniBand Clusters in comparison to Blue Gene/Q.