End-to-end packet delay and loss behavior in the internet
SIGCOMM '93 Conference proceedings on Communications architectures, protocols and applications
The performance of TCP/IP for networks with high bandwidth-delay products and random loss
IEEE/ACM Transactions on Networking (TON)
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Congestion control for high bandwidth-delay product networks
Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications
PIHSN '02 Proceedings of the 7th IFIP/IEEE International Workshop on Protocols for High Speed Networks
Reliable Blast UDP: Predictable High Performance Bulk Data Transfer
CLUSTER '02 Proceedings of the IEEE International Conference on Cluster Computing
Analysis of long duration traces
ACM SIGCOMM Computer Communication Review
Dynamic load balancing without packet reordering
ACM SIGCOMM Computer Communication Review
A machine learning approach to TCP throughput prediction
Proceedings of the 2007 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Passive measurement of one-way and two-way flow lifetimes
ACM SIGCOMM Computer Communication Review
Adaptive data block scheduling for parallel TCP streams
HPDC '05 Proceedings of the High Performance Distributed Computing, 2005. HPDC-14. Proceedings. 14th IEEE International Symposium
Dynamically tuning level of parallelism in wide area data transfers
DADC '08 Proceedings of the 2008 international workshop on Data-aware distributed computing
Maelstrom: transparent error correction for lambda networks
NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
The PARSEC benchmark suite: characterization and architectural implications
Proceedings of the 17th international conference on Parallel architectures and compilation techniques
RouteBricks: exploiting parallelism to scale software routers
Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Reliable communication for datacenters
Reliable communication for datacenters
Proceedings of the ACM SIGCOMM 2010 conference
A preliminary analysis of TCP performance in an enterprise network
INM/WREN'10 Proceedings of the 2010 internet network management conference on Research on enterprise networking
Exact temporal characterization of 10 Gbps optical wide-area network
IMC '10 Proceedings of the 10th ACM SIGCOMM conference on Internet measurement
Introspective end-system modeling to optimize the transfer time of rate based protocols
Proceedings of the 20th international symposium on High performance distributed computing
The Future of Computing Performance: Game Over or Next Level?
The Future of Computing Performance: Game Over or Next Level?
Analysis of packet loss processes in high-speed networks
IEEE Transactions on Information Theory
Operating systems abstractions for software packet processing in datacenters
Operating systems abstractions for software packet processing in datacenters
Cache-aware affinitization on commodity multicores for high-speed network flows
Proceedings of the eighth ACM/IEEE symposium on Architectures for networking and communications systems
Hi-index | 0.00 |
Data centers are being deployed in a wide variety of environments (cloud computing, scientific, financial, defense, etc.). When geographically distributed, these data centers must transmit and receive growing volumes of data. In order to avoid congestion in the public internet, most use high speed dedicated optical networks, which can be thought of as private highways for carrying data. In this work, we examined the impact of such high speed network traffic on a commodity multicore machine, and identified a number of scenarios that cause packet loss and degraded throughput due to an end-system inability to consume incoming data fast enough. We show that high speed single flow traffic nullifies the benefits of multicore systems and multiqueue NICs, and we propose an end-system aware flow bifurcation technique to optimize the data transfer time using rate based protocols. Using introspective end-system modeling, we determine the optimal number of parallel flows required to utilize the available bandwidth, and the optimal rate for each of the flows. We compare our approach with GridFTP, which is a widely used data transfer protocol in computational grids, and show that our approach performs better (particularly when the end-system losses are in the receive ring buffer.)