An Evaluation of Current High-Performance Networks

Authors:
Christian Bell;Dan Bonachea;Yannick Cote;Jason Duell;Paul Hargrove;Parry Husbands;Costin Iancu;Michael Welcome;Katherine Yelick
Affiliations:
-;-;-;-;-;-;-;-;-
Venue:
IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Year:
2003

Citing 0
Cited 28

A performance analysis of the Berkeley UPC compiler

ICS '03 Proceedings of the 17th annual international conference on Supercomputing
Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics

Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Evaluating InfiniBand Performance with PCI Express

IEEE Micro
Performance Analysis of MPI Collective Operations

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 15 - Volume 16
Performance Modeling and Tuning Strategies of Mixed Mode Collective Communications

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
High Performance Remote Memory Access Communication: The Armci Approach

International Journal of High Performance Computing Applications
Self-adapting numerical software (SANS) effort

IBM Journal of Research and Development
Quantifying the potential benefit of overlapping communication and computation in large-scale scientific applications

Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Performance evaluation of high-speed interconnects using dense communication patterns

Parallel Computing
Optimizing communication overlap for high-speed networks

Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
Performance analysis of MPI collective operations

Cluster Computing
Automatic nonblocking communication for partitioned global address space programs

Proceedings of the 21st annual international conference on Supercomputing
Optimizing a conjugate gradient solver with non-blocking collective operations

Parallel Computing
Problems with using MPI 1.1 and 2.0 as compilation targets for parallel language implementations

International Journal of High Performance Computing and Networking
Optimisation and performance evaluation of mechanisms for latency tolerance in remote memory access communication on clusters

International Journal of High Performance Computing and Networking
Evaluating high performance communication: a power perspective

Proceedings of the 23rd international conference on Supercomputing
Scalable communication protocols for dynamic sparse data exchange

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
Design and evaluation of nonblocking collective I/O operations

EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Measuring MPI send and receive overhead and application availability in high performance network interfaces

EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
Challenges and issues in benchmarking MPI

EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
Message strip-mining heuristics for high speed networks

VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Adapting distributed scientific applications to run-time network conditions

PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Architecture and early performance of the new IBM HPS fabric and adapter

HiPC'04 Proceedings of the 11th international conference on High Performance Computing
Parallel short range molecular dynamics simulations on computer clusters: Performance evaluation and modeling

Mathematical and Computer Modelling: An International Journal
Communication avoiding and overlapping for numerical linear algebra

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Netgauge: a network performance measurement framework

HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Enabling highly-scalable remote memory access programming with MPI-3 one sided

SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Performance modelling of parallel BLAST using Intel and PGI compilers on an infiniband-based HPC cluster

International Journal of Bioinformatics Research and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

High-end supercomputers are increasingly built out of commodity components, and lack tight integration between the processor and network. This often results in inefficiencies in the communication subsystem, such as high software overheads and/or message latencies. In this paper we use a set of microbenchmarks to quantify the cost of this commoditization,measuring software overhead, latency, and bandwidth on five contemporary supercomputing networks. We compare the performance of the ubiquitous MPI layer to that of lower-level communication layers, and quantify the advantages of the latter for small message performance. We also provide data on the potential for various communication-related optimizations, such as overlapping communication with computation or other communication. Finally, we determine the minimum size needed for a message to be considered large' (i.e., bandwidth-bound) on these platforms, and provide historical data on the software overheads of a number of supercomputers over the past decade.