More is less: reducing latency via redundancy

  • Authors:
  • Ashish Vulimiri;Oliver Michel;P. Brighten Godfrey;Scott Shenker

  • Affiliations:
  • UIUC;University of Vienna;UIUC;UC Berkeley and ICSI

  • Venue:
  • Proceedings of the 11th ACM Workshop on Hot Topics in Networks
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Low latency is critical for interactive networked applications. But while we know how to scale systems to increase capacity, reducing latency --- especially the tail of the latency distribution --- can be much more difficult. We argue that the use of redundancy in the context of the wide-area Internet is an effective way to convert a small amount of extra capacity into reduced latency. By initiating redundant operations across diverse resources and using the first result which completes, redundancy improves a system's latency even under exceptional conditions. We demonstrate that redundancy can significantly reduce latency for small but critical tasks, and argue that it is an effective general-purpose strategy even on devices like cell phones where bandwidth is relatively constrained.