Cray cascade: a scalable HPC system based on a Dragonfly network

Authors:
Greg Faanes;Abdulla Bataineh;Duncan Roweth;Tom Court;Edwin Froese;Bob Alverson;Tim Johnson;Joe Kopnick;Mike Higgins;James Reinhard
Affiliations:
Cray Inc., Chippewa Falls, Wisconsin;Cray Inc., Chippewa Falls, Wisconsin;Cray Inc., Chippewa Falls, Wisconsin;Cray Inc., Chippewa Falls, Wisconsin;Cray Inc., Chippewa Falls, Wisconsin;Cray Inc., Chippewa Falls, Wisconsin;Cray Inc., Chippewa Falls, Wisconsin;Cray Inc., Chippewa Falls, Wisconsin;Cray Inc., Chippewa Falls, Wisconsin;Cray Inc., Chippewa Falls, Wisconsin
Venue:
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Year:
2012

Citing 9
Cited 5

Fat-trees: universal networks for hardware-efficient supercomputing

IEEE Transactions on Computers
The BlackWidow High-Radix Clos Network

Proceedings of the 33rd annual international symposium on Computer Architecture
SeaStar Interconnect: Balanced Bandwidth for Scalable Performance

IEEE Micro
The Cray BlackWidow: a highly scalable vector multiprocessor

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Technology-Driven, Highly-Scalable Dragonfly Topology

ISCA '08 Proceedings of the 35th Annual International Symposium on Computer Architecture
The PERCS High-Performance Interconnect

HOTI '10 Proceedings of the 2010 18th IEEE Symposium on High Performance Interconnects
The Gemini System Interconnect

HOTI '10 Proceedings of the 2010 18th IEEE Symposium on High Performance Interconnects
A uGNI-based MPICH2 nemesis network module for the cray XE

EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
The IBM Blue Gene/Q Interconnection Fabric

IEEE Micro

The power 775 architecture at scale

Proceedings of the 27th international ACM conference on International conference on supercomputing
Kinetic turbulence simulations at extreme scale on leadership-class systems

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Enabling highly-scalable remote memory access programming with MPI-3 one sided

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Memory-centric system interconnect design with hybrid memory cubes

PACT '13 Proceedings of the 22nd international conference on Parallel architectures and compilation techniques
Performance implications of remote-only load balancing under adversarial traffic in Dragonflies

Proceedings of the 8th International Workshop on Interconnection Network Architecture: On-Chip, Multi-Chip

Quantified Score

Hi-index	0.00

Visualization

Abstract

Higher global bandwidth requirement for many applications and lower network cost have motivated the use of the Dragonfly network topology for high performance computing systems. In this paper we present the architecture of the Cray Cascade system, a distributed memory system based on the Dragonfly [1] network topology. We describe the structure of the system, its Dragonfly network and the routing algorithms. We describe a set of advanced features supporting both mainstream high performance computing applications and emerging global address space programing models. We present a combination of performance results from prototype systems and simulation data for large systems. We demonstrate the value of the Dragonfly topology and the benefits obtained through extensive use of adaptive routing.