Congestion avoidance and control
SIGCOMM '88 Symposium proceedings on Communications architectures and protocols
Stream control transmission protocol (SCTP): a reference guide
Stream control transmission protocol (SCTP): a reference guide
Using Multirail Networks in High-Performance Clusters
CLUSTER '01 Proceedings of the 3rd IEEE International Conference on Cluster Computing
UNIX Network Programming, Vol. 1
UNIX Network Programming, Vol. 1
Building Multirail InfiniBand Clusters: MPI-Level Design and Performance Evaluation
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
End-to-end fault tolerance using transport layer multihoming
End-to-end fault tolerance using transport layer multihoming
Self-Configuring Communication Middleware Model for Multiple Network Interfaces
COMPSAC '05 Proceedings of the 29th Annual International Computer Software and Applications Conference - Volume 01
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
ISPASS '03 Proceedings of the 2003 IEEE International Symposium on Performance Analysis of Systems and Software
High-performance message striping over reliable transport protocols
The Journal of Supercomputing
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
End-to-end concurrent multipath transfer using transport layer multihoming
End-to-end concurrent multipath transfer using transport layer multihoming
Concurrent multipath transfer using SCTP multihoming over independent end-to-end paths
IEEE/ACM Transactions on Networking (TON)
Performance implications of a bounded receive buffer in concurrent multipath transfer
Computer Communications
RI2N/DRV: Multi-link ethernet for high-bandwidth and fault-tolerant network on PC clusters
IPDPS '09 Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
NETWORKING'08 Proceedings of the 7th international IFIP-TC6 networking conference on AdHoc and sensor networks, wireless networks, next generation internet
Concurrent multipath transfer using transport layer multihoming: performance under network failures
MILCOM'06 Proceedings of the 2006 IEEE conference on Military communications
An analysis of TCP processing overhead
IEEE Communications Magazine
End system optimizations for high-speed TCP
IEEE Communications Magazine
Using CMT in SCTP-based MPI to exploit multiple interfaces in cluster nodes
PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Hi-index | 0.00 |
Building clusters from commodity off-the-shelf parts is a well-established technique for building inexpensive medium- to large-size computing clusters. Many commodity mid-range motherboards come with multiple Gigabit Ethernet interfaces, and the low cost per port for Gigabit Ethernet makes switches inexpensive as well. Our objective in this work is to take advantage of multiple inexpensive Gigabit network cards and Ethernet switches to enhance the communication and reliability performance of a cluster. Unlike previous approaches that take advantage of multiple network connections for multi-railing, we consider CMT (Concurrent Multipath Transfer) that extends SCTP (Stream Control Transmission Protocol), a transport protocol developed by the IETF, to make use of the multiple paths that exist between two hosts. In this work, we explore the applicability of CMT in the transport layer of the network stack to high-performance computing environments. We develop SCTP-based MPI (Message Passing Interface) middleware for MPICH2 and Open MPI, and evaluate the reliability and communication performance of the system. Using Open MPI with support for message striping over multiple paths at the middleware level, we compare the differences in supporting multi-railing in the middleware versus at the transport layer.