Efficient data transfer protocols for big data

  • Authors:
  • Ezra Kissel;Martin Swany;Eric Pouyoul;Brian Tierney

  • Affiliations:
  • School of Informatics and Computing, Indiana University, Bloomington, IN 47405;School of Informatics and Computing, Indiana University, Bloomington, IN 47405;Lawrence Berkeley National Laboratory, Berkeley, CA 94270;Lawrence Berkeley National Laboratory, Berkeley, CA 94270

  • Venue:
  • E-SCIENCE '12 Proceedings of the 2012 IEEE 8th International Conference on E-Science (e-Science)
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data set sizes are growing exponentially, so it is important to use data movement protocols that are the most efficient available. Most data movement tools today rely on TCP over sockets, which limits flows to around 20Gbps on today's hardware. RDMA over Converged Ethernet (RoCE) is a promising new technology for high-performance network data movement with minimal CPU impact over circuit-based infrastructures. We compare the performance of TCP, UDP, UDT, and RoCE over high latency 10Gbps and 40Gbps network paths, and show that RoCE-based data transfers can fill a 40Gbps path using much less CPU than other protocols. We also show that the Linux zero-copy system calls can improve TCP performance considerably, especially on current Intel “Sandy Bridge”-based PCI Express 3.0 (Gen3) hosts.