Understanding and addressing blocking-induced network server latency

Authors:
Yaoping Ruan;Vivek Pai
Affiliations:
IBM T.J. Watson Research Center;Princeton University
Venue:
ATEC '06 Proceedings of the annual conference on USENIX '06 Annual Technical Conference
Year:
2006

Citing 0
Cited 3

vPath: precise discovery of request processing paths from black-box observations of thread and network activities

USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
To chunk or not to chunk: implications for HTTP streaming video server performance

Proceedings of the 22nd international workshop on Network and Operating System Support for Digital Audio and Video
Methodologies for generating HTTP streaming video workloads to evaluate web server performance

Proceedings of the 5th Annual International Systems and Storage Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

We investigate the origin and components of network server latency under various loads and find that filesystem-related kernel queues exhibit head-of-line blocking, which leads to bursty behavior in event delivery and process scheduling. In turn, these problems degrade the existing fairness and scheduling policies in the operating system, causing requests that could have been served in memory, with low latency, to unnecessarily wait on disk-bound requests. While this batching behavior only mildly affects throughput, it severely degrades latency. This problem manifests itself in fairness and service quality degradation, a phenomenon we call service inversion. We show a portable solution that avoids these problems without kernel or filesystem modifications, We modify two different Web servers to use this approach, and demonstrate a qualitatively different change in their latency profiles, generating more than an order of magnitude reduction in latency. The resulting systems are able to serve most requests without being tied to disk performance, and they scale better with improvements in processor speed. These results are not dependent on server software architecture, and can be profitably applied to experimental and production servers.