Locality-aware request distribution in cluster-based network servers
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Dynamic tracking of page miss ratio curve for memory management
ASPLOS XI Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
PRESS: A Clustered Server Based on User-Level Communication
IEEE Transactions on Parallel and Distributed Systems
Managing server energy and operational costs in hosting centers
SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Energy conservation in heterogeneous server clusters
Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
On evaluating request-distribution schemes for saving energy in server clusters
ISPASS '03 Proceedings of the 2003 IEEE International Symposium on Performance Analysis of Systems and Software
Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
No "power" struggles: coordinated multi-level power management for the data center
Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
PowerNap: eliminating server idle power
Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
Somniloquy: augmenting network interfaces to reduce PC energy usage
NSDI'09 Proceedings of the 6th USENIX symposium on Networked systems design and implementation
Evaluation techniques for storage hierarchies
IBM Systems Journal
Energy-efficient server clusters
PACS'02 Proceedings of the 2nd international conference on Power-aware computer systems
Power management of online data-intensive services
Proceedings of the 38th annual international symposium on Computer architecture
Hi-index | 0.00 |
Internet service providers face the daunting task of maintaining guaranteed latency requirements while reducing power requirements. In this work, we focus on a class of services with very high cpu and memory demands, best represented by internet search. These services provide strict latency guarantees defined in Service-Level Agreements, yet the clusters need to be flexible to different optimizations, i.e. to minimize power consumption or to maximize resource usage. Unfortunately, standard cluster algorithms, such as resource allocation, are oblivious of the SLA allocations, while power management is typically only driven by cpu demand. We propose a power-aware resource allocation algorithm for the cpu and the memory which is driven by SLA and allows for various dynamic cluster configurations, from energy-optimal to resource-usage-optimal. Using trace-based simulation of two service models, we show that up to 24% energy can be preserved compared to the state-of-art scheme, or maximum memory utility can be achieved with 20% savings.