Power-aware resource allocation for CPU-and memory-intense internet services

Authors:
Vlasia Anagnostopoulou;Susmit Biswas;Heba Saadeldeen;Ricardo Bianchini;Tao Yang;Diana Franklin;Frederic T. Chong
Affiliations:
Department of Computer Science, University of California, Santa Barbara;Department of Computer Science, University of California, Santa Barbara;Department of Computer Science, University of California, Santa Barbara;Department of Computer Science, Rutgers University;Department of Computer Science, University of California, Santa Barbara;Department of Computer Science, University of California, Santa Barbara;Department of Computer Science, University of California, Santa Barbara
Venue:
E2DC'12 Proceedings of the First international conference on Energy Efficient Data Centers
Year:
2012

Citing 15
Cited 0

Locality-aware request distribution in cluster-based network servers

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Dynamic tracking of page miss ratio curve for memory management

ASPLOS XI Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
PRESS: A Clustered Server Based on User-Level Communication

IEEE Transactions on Parallel and Distributed Systems
Managing server energy and operational costs in hosting centers

SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Energy conservation in heterogeneous server clusters

Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
On evaluating request-distribution schemes for saving energy in server clusters

ISPASS '03 Proceedings of the 2003 IEEE International Symposium on Performance Analysis of Systems and Software
Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
The Case for Energy-Proportional Computing

Computer
No "power" struggles: coordinated multi-level power management for the data center

Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
PowerNap: eliminating server idle power

Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
Somniloquy: augmenting network interfaces to reduce PC energy usage

NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
Evaluation techniques for storage hierarchies

IBM Systems Journal
Energy-efficient server clusters

PACS'02 Proceedings of the 2nd international conference on Power-aware computer systems
Server Engineering Insights for Large-Scale Online Services

IEEE Micro
Power management of online data-intensive services

Proceedings of the 38th annual international symposium on Computer architecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

Internet service providers face the daunting task of maintaining guaranteed latency requirements while reducing power requirements. In this work, we focus on a class of services with very high cpu and memory demands, best represented by internet search. These services provide strict latency guarantees defined in Service-Level Agreements, yet the clusters need to be flexible to different optimizations, i.e. to minimize power consumption or to maximize resource usage. Unfortunately, standard cluster algorithms, such as resource allocation, are oblivious of the SLA allocations, while power management is typically only driven by cpu demand. We propose a power-aware resource allocation algorithm for the cpu and the memory which is driven by SLA and allows for various dynamic cluster configurations, from energy-optimal to resource-usage-optimal. Using trace-based simulation of two service models, we show that up to 24% energy can be preserved compared to the state-of-art scheme, or maximum memory utility can be achieved with 20% savings.