Less is more: trading a little bandwidth for ultra-low latency in the data center

Authors:
Mohammad Alizadeh;Abdul Kabbani;Tom Edsall;Balaji Prabhakar;Amin Vahdat;Masato Yasuda
Affiliations:
Stanford University;Google;Cisco Systems;Stanford University;Google and U.C. San Diego;NEC Corporation, Japan
Venue:
NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Year:
2012

Citing 37
Cited 25

Congestion avoidance and control

SIGCOMM '88 Symposium proceedings on Communications architectures and protocols
Analysis and simulation of a fair queueing algorithm

SIGCOMM '89 Symposium proceedings on Communications architectures & protocols
Observations on the dynamics of a congestion control algorithm: the effects of two-way traffic

SIGCOMM '91 Proceedings of the conference on Communications architecture & protocols
A generalized processor sharing approach to flow control in integrated services networks: the single-node case

IEEE/ACM Transactions on Networking (TON)
Random early detection gateways for congestion avoidance

IEEE/ACM Transactions on Networking (TON)
TCP Vegas: new techniques for congestion detection and avoidance

SIGCOMM '94 Proceedings of the conference on Communications architectures, protocols and applications
Efficient fair queueing using deficit round robin

SIGCOMM '95 Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Dynamics of random early detection

SIGCOMM '97 Proceedings of the ACM SIGCOMM '97 conference on Applications, technologies, architectures, and protocols for computer communication
Statistical bandwidth sharing: a study of congestion at flow level

Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
The BLUE active queue management algorithms

IEEE/ACM Transactions on Networking (TON)
EMP: zero-copy OS-bypass NIC-driven gigabit ethernet message passing

Proceedings of the 2001 ACM/IEEE conference on Supercomputing
Congestion control for high bandwidth-delay product networks

Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications
The Mathematics of Internet Congestion Control (Systems and Control: Foundations and Applications)

The Mathematics of Internet Congestion Control (Systems and Control: Foundations and Applications)
An adaptive virtual queue (AVQ) algorithm for active queue management

IEEE/ACM Transactions on Networking (TON)
A survey on statistical bandwidth sharing

Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: In memroy of Olga Casals
Sizing router buffers

Proceedings of the 2004 conference on Applications, technologies, architectures, and protocols for computer communications
Inter-Layer Coordination for Parallel TCP Streams on Long Fat Pipe Networks

Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Why is the internet traffic bursty in short time scales?

SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
FAST TCP: motivation, architecture, algorithms, performance

IEEE/ACM Transactions on Networking (TON)
Open versus closed: a cautionary tale

NSDI'06 Proceedings of the 3rd conference on Networked Systems Design & Implementation - Volume 3
Designing packet buffers for router linecards

IEEE/ACM Transactions on Networking (TON)
A scalable, commodity data center network architecture

Proceedings of the ACM SIGCOMM 2008 conference on Data communication
Operating System Concepts

Operating System Concepts
Experimental study of router buffer sizing

Proceedings of the 8th ACM SIGCOMM conference on Internet measurement
Perspectives on router buffer sizing: recent results and open problems

ACM SIGCOMM Computer Communication Review
VL2: a scalable and flexible data center network

Proceedings of the ACM SIGCOMM 2009 conference on Data communication
Every microsecond counts: tracking fine-grain latencies with a lossy difference aggregator

Proceedings of the ACM SIGCOMM 2009 conference on Data communication
Safe and effective fine-grained TCP retransmissions for datacenter communication

Proceedings of the ACM SIGCOMM 2009 conference on Data communication
Remote Direct Memory Access over the Converged Enhanced Ethernet Fabric: Evaluating the Options

HOTI '09 Proceedings of the 2009 17th IEEE Symposium on High Performance Interconnects
Understanding data center traffic characteristics

ACM SIGCOMM Computer Communication Review
Data center TCP (DCTCP)

Proceedings of the ACM SIGCOMM 2010 conference
The case for RAMCloud

Communications of the ACM
Analysis of DCTCP: stability, convergence, and fairness

Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Fast crash recovery in RAMCloud

SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Processor sharing flows in the internet

IWQoS'05 Proceedings of the 13th international conference on Quality of Service
RSVP: a new resource reservation protocol

IEEE Communications Magazine - Part Anniversary
REM: active queue management

IEEE Network: The Magazine of Global Internetworking

Reviving delay-based TCP for data centers

Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
DeTail: reducing the flow completion time tail in datacenter networks

Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Reviving delay-based TCP for data centers

ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
DeTail: reducing the flow completion time tail in datacenter networks

ACM SIGCOMM Computer Communication Review - Special october issue SIGCOMM '12
Deconstructing datacenter packet transport

Proceedings of the 11th ACM Workshop on Hot Topics in Networks
Chronos: predictable low latency for data center applications

Proceedings of the Third ACM Symposium on Cloud Computing
Reproducible network experiments using container-based emulation

Proceedings of the 8th international conference on Emerging networking experiments and technologies
EyeQ: practical network performance isolation at the edge

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Bobtail: avoiding long tails in the cloud

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Got loss? Get zOVN!

Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
pFabric: minimal near-optimal datacenter transport

Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
R2D2: bufferless, switchless data center networks using commodity ethernet hardware

Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles

ACM SIGOPS 24th Symposium on Operating Systems Principles
IOFlow: a software-defined storage architecture

Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Small is better: avoiding latency traps in virtualized data centers

Proceedings of the 4th annual Symposium on Cloud Computing
jVerbs: ultra-low latency for data center applications

Proceedings of the 4th annual Symposium on Cloud Computing
Per-packet load-balanced, low-latency routing for clos-based data center networks

Proceedings of the ninth ACM conference on Emerging networking experiments and technologies
Explicit multipath congestion control for data center networks

Proceedings of the ninth ACM conference on Emerging networking experiments and technologies
Bullet trains: a study of NIC burst behavior at microsecond timescales

Proceedings of the ninth ACM conference on Emerging networking experiments and technologies
Network interface design for low latency request-response protocols

USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
Towards minimal-delay deadline-driven data center TCP

Proceedings of the Twelfth ACM Workshop on Hot Topics in Networks
Dahu: commodity switches for direct connect data center networks

ANCS '13 Proceedings of the ninth ACM/IEEE symposium on Architectures for networking and communications systems
Catch the whole lot in an action: rapid precise packet loss notification in data centers

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
High throughput data center topology design

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
SENIC: scalable NIC for end-host rate limiting

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditional measures of network goodness--goodput, quality of service, fairness--are expressed in terms of bandwidth. Network latency has rarely been a primary concern because delivering the highest level of bandwidth essentially entails driving up latency--at the mean and, especially, at the tail. Recently, however, there has been renewed interest in latency as a primary metric for mainstream applications. In this paper, we present the HULL (High-bandwidth Ultra-Low Latency) architecture to balance two seemingly contradictory goals: near baseline fabric latency and high bandwidth utilization. HULL leaves 'bandwidth headroom' using Phantom Queues that deliver congestion signals before network links are fully utilized and queues form at switches. By capping utilization at less than link capacity, we leave room for latency sensitive traffic to avoid buffering and the associated large delays. At the same time, we use DCTCP, a recently proposed congestion control algorithm, to adaptively respond to congestion and to mitigate the bandwidth penalties which arise from operating in a bufferless fashion. HULL further employs packet pacing to counter burstiness caused by Interrupt Coalescing and Large Send Offloading. Our implementation and simulation results show that by sacrificing a small amount (e.g., 10%) of bandwidth, HULL can dramatically reduce average and tail latencies in the data center.