Memory-manager/scheduler co-design: optimizing event-driven servers to improve cache behavior

Authors:
Sapan Bhatia;Charles Consel;Julia Lawall
Affiliations:
INRIA/LaBRI;INRIA/LaBRI;University of Copenhagen
Venue:
Proceedings of the 5th international symposium on Memory management
Year:
2006

Citing 12
Cited 2

DPF: fast, flexible message demultiplexing using dynamic code generation

Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
Speeding up protocols for small messages

Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
Masking the overhead of protocol layering

Conference proceedings on Applications, technologies, architectures, and protocols for computer communications
Cache-conscious data placement

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Cache-conscious structure layout

Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
Cache-conscious structure definition

Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
SEDA: an architecture for well-conditioned, scalable internet services

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Profile-directed optimization of event-based programs

PLDI '02 Proceedings of the ACM SIGPLAN 2002 Conference on Programming language design and implementation
Using Cohort-Scheduling to Enhance Server Performance

ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
Capriccio: scalable threads for internet services

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Lazy asynchronous I/O for event-driven servers

ATEC '04 Proceedings of the annual conference on USENIX Annual Technical Conference
Flash: an efficient and portable web server

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference

Issues in holistic system design

Proceedings of the 3rd workshop on Programming languages and operating systems: linguistic support for modern operating systems
Minimizing accumulative memory load cost on multi-core DSPs with multi-level memory

Journal of Systems Architecture: the EUROMICRO Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Event-driven programming has emerged as a standard to implement high-performance servers due to its flexibility and low OS overhead. Still, memory access remains a bottleneck. Generic optimization techniques yield only small improvements in the memory access behavior of event-driven servers, as such techniques do not exploit their specific structure and behavior.This paper presents an optimization framework dedicated to event-driven servers, based on a strategy to eliminate data-cache misses. We propose a novel memory manager combined with a tailored scheduling strategy to restrict the working data set of the program to a memory region mapped directly into the data cache. Our approach exploits the flexible scheduling and deterministic execution of event-driven servers.We have applied our framework to industry-standard webservers including TUX and thttpd, as well as to the Squid proxy server and the Cactus QoS framework. Testing TUX and thttpd using a standard HTTP benchmark tool shows that our optimizations applied to the TUX web server reduce L2 data cache misses under heavy load by up to 75% and increase the throughput of the server by up to 38%.