Minimizing the Data Transfer Time Using Multicore End-System Aware Flow Bifurcation

Authors:
Vishal Ahuja;Dipak Ghosal;Matthew Farrens
Affiliations:
-;-;-
Venue:
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Year:
2012

Citing 23
Cited 1

End-to-end packet delay and loss behavior in the internet

SIGCOMM '93 Conference proceedings on Communications architectures, protocols and applications
The performance of TCP/IP for networks with high bandwidth-delay products and random loss

IEEE/ACM Transactions on Networking (TON)
PSockets: the case for application-level network striping for data intensive applications using high speed wide area networks

Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Congestion control for high bandwidth-delay product networks

Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications
Dynamic Right-Sizing: An Automated, Lightweight, and Scalable Technique for Enhancing Grid Performance

PIHSN '02 Proceedings of the 7th IFIP/IEEE International Workshop on Protocols for High Speed Networks
Reliable Blast UDP: Predictable High Performance Bulk Data Transfer

CLUSTER '02 Proceedings of the IEEE International Conference on Cluster Computing
Analysis of long duration traces

ACM SIGCOMM Computer Communication Review
Dynamic load balancing without packet reordering

ACM SIGCOMM Computer Communication Review
A machine learning approach to TCP throughput prediction

Proceedings of the 2007 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Passive measurement of one-way and two-way flow lifetimes

ACM SIGCOMM Computer Communication Review
Adaptive data block scheduling for parallel TCP streams

HPDC '05 Proceedings of the High Performance Distributed Computing, 2005. HPDC-14. Proceedings. 14th IEEE International Symposium
Dynamically tuning level of parallelism in wide area data transfers

DADC '08 Proceedings of the 2008 international workshop on Data-aware distributed computing
Maelstrom: transparent error correction for lambda networks

NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
The PARSEC benchmark suite: characterization and architectural implications

Proceedings of the 17th international conference on Parallel architectures and compilation techniques
RouteBricks: exploiting parallelism to scale software routers

Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Reliable communication for datacenters

Reliable communication for datacenters
Data center TCP (DCTCP)

Proceedings of the ACM SIGCOMM 2010 conference
A preliminary analysis of TCP performance in an enterprise network

INM/WREN'10 Proceedings of the 2010 internet network management conference on Research on enterprise networking
Exact temporal characterization of 10 Gbps optical wide-area network

IMC '10 Proceedings of the 10th ACM SIGCOMM conference on Internet measurement
Introspective end-system modeling to optimize the transfer time of rate based protocols

Proceedings of the 20th international symposium on High performance distributed computing
The Future of Computing Performance: Game Over or Next Level?

The Future of Computing Performance: Game Over or Next Level?
Analysis of packet loss processes in high-speed networks

IEEE Transactions on Information Theory
Operating systems abstractions for software packet processing in datacenters

Operating systems abstractions for software packet processing in datacenters

Cache-aware affinitization on commodity multicores for high-speed network flows

Proceedings of the eighth ACM/IEEE symposium on Architectures for networking and communications systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data centers are being deployed in a wide variety of environments (cloud computing, scientific, financial, defense, etc.). When geographically distributed, these data centers must transmit and receive growing volumes of data. In order to avoid congestion in the public internet, most use high speed dedicated optical networks, which can be thought of as private highways for carrying data. In this work, we examined the impact of such high speed network traffic on a commodity multicore machine, and identified a number of scenarios that cause packet loss and degraded throughput due to an end-system inability to consume incoming data fast enough. We show that high speed single flow traffic nullifies the benefits of multicore systems and multiqueue NICs, and we propose an end-system aware flow bifurcation technique to optimize the data transfer time using rate based protocols. Using introspective end-system modeling, we determine the optimal number of parallel flows required to utilize the available bandwidth, and the optimal rate for each of the flows. We compare our approach with GridFTP, which is a widely used data transfer protocol in computational grids, and show that our approach performs better (particularly when the end-system losses are in the receive ring buffer.)