MPI and Embedded TCP/IP Gigabit Ethernet Cluster Computing
LCN '02 Proceedings of the 27th Annual IEEE Conference on Local Computer Networks
Performance Characterization of a 10-Gigabit Ethernet TOE
HOTI '05 Proceedings of the 13th Symposium on High Performance Interconnects
Wide-area performance profiling of 10GigE and InfiniBand technologies
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Gravel: A Communication Library to Fast Path MPI
Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Minimizing the Hidden Cost of RDMA
ICDCS '09 Proceedings of the 2009 29th IEEE International Conference on Distributed Computing Systems
Remote Direct Memory Access over the Converged Enhanced Ethernet Fabric: Evaluating the Options
HOTI '09 Proceedings of the 2009 17th IEEE Symposium on High Performance Interconnects
Designing Efficient FTP Mechanisms for High Performance Data-Transfer over InfiniBand
ICPP '09 Proceedings of the 2009 International Conference on Parallel Processing
High Performance Data Transfer in Grid Environment Using GridFTP over InfiniBand
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
ICPPW '10 Proceedings of the 2010 39th International Conference on Parallel Processing Workshops
Middleware Support for RDMA-based Data Transfer in Cloud Computing
IPDPSW '12 Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum
Journal of Systems and Software
Design and performance evaluation of NUMA-aware RDMA-based end-to-end data transfer systems
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Efficient wide area data transfer protocols for 100 Gbps networks and beyond
NDM '13 Proceedings of the Third International Workshop on Network-Aware Data Management
Hi-index | 0.00 |
Providing high-speed data transfer is vital to various data-intensive applications. While there have been remarkable technology advances to provide ultra-high-speed network bandwidth, existing protocols and applications may not be able to fully utilize the bare-metal bandwidth due to their inefficient design. We identify the same problem remains in the field of Remote Direct Memory Access (RDMA) networks. RDMA offloads TCP/IP protocols to hardware devices. However, its benefits have not been fully exploited due to the lack of efficient software and application protocols, in particular in wide-area networks. In this paper, we address the design choices to develop such protocols. We describe a protocol implemented as part of a communication middleware. The protocol has its flow control, connection management, and task synchronization. It maximizes the parallelism of RDMA operations. We demonstrate its performance benefit on various local and wide-area testbeds, including the DOE ANI testbed with RoCE links and InfiniBand links.