A preliminary analysis of the infinipath and XD1 network interfaces

Authors:
Ron Brightwell;Doug Doerfler;Keith D. Underwood
Affiliations:
Center for Computation, Computers, Information, and Math, Sandia National Laboratories, Albuquerque, NM;Center for Computation, Computers, Information, and Math, Sandia National Laboratories, Albuquerque, NM;Center for Computation, Computers, Information, and Math, Sandia National Laboratories, Albuquerque, NM
Venue:
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Year:
2006

Citing 13
Cited 4

Effects of communication latency, overhead, and bandwidth in a cluster architecture

Proceedings of the 24th annual international symposium on Computer architecture
The Quadrics Network: High-Performance Clustering Technology

IEEE Micro
COMB: A Portable Benchmark Suite for Assessing MPI Overlap

CLUSTER '02 Proceedings of the IEEE International Conference on Cluster Computing
Efficient Collective Operations Using Remote Memory Operations on VIA-Based Clusters

IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
An analysis of the impact of MPI overlap and independent progress

Proceedings of the 18th annual international conference on Supercomputing
Efficient and Scalable All-to-All Personalized Exchange for InfiniBand-Based Clusters

ICPP '04 Proceedings of the 2004 International Conference on Parallel Processing
Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics

Proceedings of the 2003 ACM/IEEE conference on Supercomputing
PathScale InfiniPath: A First Look

HOTI '05 Proceedings of the 13th Symposium on High Performance Interconnects
Initial Performance Evaluation of the Cray SeaStar Interconnect

HOTI '05 Proceedings of the 13th Symposium on High Performance Interconnects
Efficient Barrier and Allreduce on Infiniband clusters using multicast and adaptive algorithms

CLUSTER '04 Proceedings of the 2004 IEEE International Conference on Cluster Computing
A comparison of 4X InfiniBand and Quadrics Elan-4 technologies

CLUSTER '04 Proceedings of the 2004 IEEE International Conference on Cluster Computing
Microbenchmark Performance Comparison of High-Speed Cluster Interconnects

IEEE Micro
A comparison of three MPI implementations for red storm

PVM/MPI'05 Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface

Performance evaluation on low-latency Communication mechanism of DIMMnet-2

PDCN'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: parallel and distributed computing and networks
Evaluating NIC hardware requirements to achieve high message rate PGAS support on multi-core processors

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
A new ultra-low latency message transfer mechanism

CSN '07 Proceedings of the Sixth IASTED International Conference on Communication Systems and Networks
Measuring MPI send and receive overhead and application availability in high performance network interfaces

EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface

Quantified Score

Hi-index	0.00

Visualization

Abstract

Two recently delivered systems have begun a new trend in cluster interconnects. Both the InfiniPath network from PathScale, Inc., and the RapidArray fabric in the XD1 system from Cray, Inc., leverage commodity network fabrics while customizing the network interface in an attempt to add value specifically for the high performance computing (HPC) cluster market. Both network interfaces are compatible with standard InfiniBand (IB) switches, but neither use the traditional programming interfaces to support MPI. Another fundamental difference between these networks and other modern network adapters is that much of the processing needed for the network protocol stack is performed on the host processor(s) rather than by the network interface itself. This approach stands in stark contrast to the current direction of most high-performance networking activities, which is to offload as much protocol processing as possible to the network interface. In this paper, we provide an initial performance comparison of the two partially custom networks (PathScale's InfiniPath and Cray's XD1) with a more commodity network (standard IB) and a more custom network (Quadrics Elan4). Our evaluation includes several micro-benchmark results as well as some initial application performance data.