Small is better: avoiding latency traps in virtualized data centers

Authors:
Yunjing Xu;Michael Bailey;Brian Noble;Farnam Jahanian
Affiliations:
University of Michigan;University of Michigan;University of Michigan;University of Michigan
Venue:
Proceedings of the 4th annual Symposium on Cloud Computing
Year:
2013

Citing 42
Cited 0

Size-based scheduling to improve web performance

ACM Transactions on Computer Systems (TOCS)
When Virtual Is Better Than Real

HOTOS '01 Proceedings of the Eighth Workshop on Hot Topics in Operating Systems
The War between Mice and Elephants

ICNP '01 Proceedings of the Ninth International Conference on Network Protocols
Xen and the art of virtualization

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Sizing router buffers

Proceedings of the 2004 conference on Applications, technologies, architectures, and protocols for computer communications
Latency lags bandwith

Communications of the ACM - Voting systems
VSched: Mixing Batch And Interactive Virtual Machines Using Periodic Real-time Scheduling

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
SPEC CPU2006 benchmark descriptions

ACM SIGARCH Computer Architecture News
Xen and co.: communication-aware CPU scheduling for consolidated xen-based hosting platforms

Proceedings of the 3rd international conference on Virtual execution environments
Dynamo: amazon's highly available key-value store

Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
A scalable, commodity data center network architecture

Proceedings of the ACM SIGCOMM 2008 conference on Data communication
Task-aware virtual machine scheduling for I/O performance.

Proceedings of the 2009 ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Hey, you, get off of my cloud: exploring information leakage in third-party compute clouds

Proceedings of the 16th ACM conference on Computer and communications security
Empirical evaluation of latency-sensitive application performance in the cloud

MMSys '10 Proceedings of the first annual ACM SIGMM conference on Multimedia systems
Supporting soft real-time tasks in the xen hypervisor

Proceedings of the 6th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
The impact of virtualization on network performance of amazon EC2 data center

INFOCOM'10 Proceedings of the 29th conference on Information communications
Data center TCP (DCTCP)

Proceedings of the ACM SIGCOMM 2010 conference
I/O scheduling model of virtual machine based on multi-core dynamic partitioning

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
CloudCmp: comparing public cloud providers

IMC '10 Proceedings of the 10th ACM SIGCOMM conference on Internet measurement
Network traffic characteristics of data centers in the wild

IMC '10 Proceedings of the 10th ACM SIGCOMM conference on Internet measurement
Runtime measurements in the cloud: observing, analyzing, and reducing variance

Proceedings of the VLDB Endowment
An analysis of Linux scalability to many cores

OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Sharing the data center network

Proceedings of the 8th USENIX conference on Networked systems design and implementation
It's time for low latency

HotOS'13 Proceedings of the 13th USENIX conference on Hot topics in operating systems
Better never than late: meeting deadlines in datacenter networks

Proceedings of the ACM SIGCOMM 2011 conference
RT-Xen: towards real-time hypervisor scheduling in xen

EMSOFT '11 Proceedings of the ninth ACM international conference on Embedded software
An exploration of L2 cache covert channels in virtualized environments

Proceedings of the 3rd ACM workshop on Cloud computing security workshop
Controlling Queue Delay

Queue - Networks
Less is more: trading a little bandwidth for ultra-low latency in the data center

NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
vSlicer: latency-aware virtual machine scheduling via differentiated-frequency CPU slicing

Proceedings of the 21st international symposium on High-Performance Parallel and Distributed Computing
Deadline-aware datacenter tcp (D2TCP)

Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Finishing flows quickly with preemptive scheduling

Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
DeTail: reducing the flow completion time tail in datacenter networks

Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
FairCloud: sharing the network in cloud computing

Proceedings of the ACM SIGCOMM 2012 conference on Applications, technologies, architectures, and protocols for computer communication
Resource-freeing attacks: improve your cloud performance (at your neighbor's expense)

Proceedings of the 2012 ACM conference on Computer and communications security
Deconstructing datacenter packet transport

Proceedings of the 11th ACM Workshop on Hot Topics in Networks
vBalance: using interrupt load balance to improve I/O performance for SMP virtual machines

Proceedings of the Third ACM Symposium on Cloud Computing
Chronos: predictable low latency for data center applications

Proceedings of the Third ACM Symposium on Cloud Computing
Chatty tenants and the cloud network sharing problem

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
EyeQ: practical network performance isolation at the edge

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Bobtail: avoiding long tails in the cloud

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
vTurbo: accelerating virtual machine I/O processing using designated turbo-sliced core

USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

Public clouds have become a popular platform for building Internet-scale applications. Using virtualization, public cloud services grant customers full control of guest operating systems and applications, while service providers still retain the management of their host infrastructure. Because applications built with public clouds are often highly sensitive to response time, infrastructure builders strive to reduce the latency of their data center's internal network. However, most existing solutions require modification to the software stack controlled by guests. We introduce a new host-centric solution for improving latency in virtualized cloud environments. In this approach, we extend a classic scheduling principle---Shortest Remaining Time First---from the virtualization layer, through the host network stack, to the network switches. Experimental and simulation results show that our solution can reduce median latency of small flows by 40%, with improvements in the tail of almost 90%, while reducing throughput of large flows by less than 3%.