DASH-IO: an empirical study of flash-based IO for HPC
Proceedings of the 2010 TeraGrid Conference
DASH: a Recipe for a Flash-based Data Intensive Supercomputer
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
Subset removal on massive data with Dash
Proceedings of the 2011 TeraGrid Conference: Extreme Digital Discovery
Evaluation of I/O technologies on a flash-based I/O sub-system for HPC
Proceedings of the 1st Workshop on Architectures and Systems for Big Data
Exploring Twitter networks in parallel computing environments
Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery
Hadoop deployment and performance on Gordon data intensive supercomputer
Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery
Using Gordon to accelerate LHC science
Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery
Enabling fair pricing on HPC systems with node sharing
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Exploring the future of out-of-core computing with compute-local non-volatile memory
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
The Gordon data intensive supercomputer entered service in 2012 as an allocable computing system in the NSF Extreme Science and Engineering Discovery Environment (XSEDE) program. Gordon has several innovative features that make it ideal for data intensive computing including: 1,024, compute nodes based on Intel's Sandy Bridge (Xeon E5) processor; 64 I/O nodes with an aggregate of 300 TB of high performance flash (SSD); large, virtual SMP "supernodes" of up to 2 TB DRAM; a dual-rail, QDR InfiniBand, 3D torus network based on commodity hardware and open source software; and a 100 GB/s Lustre based parallel file system, with over 4 PB of disk space. In this paper we present the motivation, design, and performance of Gordon. We provide: low level micro-benchmark results to demonstrate processor, memory, I/O, and network performance; standard HPC benchmarks; and performance on data intensive applications to demonstrate Gordon's performance on typical workloads. We highlight the inherent risks in, and describe mitigation strategies for, deploying a data intensive supercomputer like Gordon which embodies significant innovative technologies. Finally we present our experiences thus far in supporting users and managing Gordon.