The simulation and evaluation of dynamic voltage scaling algorithms
ISLPED '98 Proceedings of the 1998 international symposium on Low power electronics and design
httperf—a tool for measuring web server performance
ACM SIGMETRICS Performance Evaluation Review
Quantitative comparison of power management algorithms
DATE '00 Proceedings of the conference on Design, automation and test in Europe
Managing energy and server resources in hosting centers
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
iBOM: A Platform for Intelligent Business Operation Management
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
An analytical model for multi-tier internet services and its applications
SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Managing server energy and operational costs in hosting centers
SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Dynamic Provisioning of Multi-tier Internet Applications
ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Feedback Control Architecture and Design Methodology for Service Delay Guarantees in Web Servers
IEEE Transactions on Parallel and Distributed Systems
Power provisioning for a warehouse-sized computer
Proceedings of the 34th annual international symposium on Computer architecture
Policies for dynamic clock scheduling
OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Dynamo: amazon's highly available key-value store
Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Adaptive quality of service management for enterprise services
ACM Transactions on the Web (TWEB)
Energy-aware server provisioning and load dispatching for connection-intensive internet services
NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
Multi-mode energy management for multi-tier server clusters
Proceedings of the 17th international conference on Parallel architectures and compilation techniques
PowerNap: eliminating server idle power
Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
Q-clouds: managing performance interference effects for QoS-aware clouds
Proceedings of the 5th European conference on Computer systems
Energy-efficient server clusters
PACS'02 Proceedings of the 2nd international conference on Power-aware computer systems
Stochastic approximation control of power and tardiness in a three-tier web-hosting cluster
Proceedings of the 7th international conference on Autonomic computing
NapSAC: design and implementation of a power-proportional web cluster
Proceedings of the first ACM SIGCOMM workshop on Green networking
Statistical machine learning makes automatic control practical for internet datacenters
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Server workload analysis for power minimization using consolidation
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Dynamic knobs for responsive power-aware computing
Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
Black-box and gray-box strategies for virtual machine migration
NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation
Power management of online data-intensive services
Proceedings of the 38th annual international symposium on Computer architecture
A dynamic energy management scheme for multi-tier data centers
ISPASS '11 Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software
The case for sleep states in servers
HotPower '11 Proceedings of the 4th Workshop on Power-Aware Computing and Systems
Minimizing data center SLA violations and power consumption via hybrid resource provisioning
IGCC '11 Proceedings of the 2011 International Green Computing Conference and Workshops
A control-based middleware framework for quality-of-service adaptations
IEEE Journal on Selected Areas in Communications
Agile, efficient virtualization power management with low-latency server power states
Proceedings of the 40th Annual International Symposium on Computer Architecture
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
ACM SIGMETRICS Performance Evaluation Review - Special issue on the 31st international symposium on computer performance, modeling, measurements and evaluation (IFIPWG 7.3 Performance 2013)
Elastic resource management for heterogeneous applications on PaaS
Proceedings of the 5th Asia-Pacific Symposium on Internetware
Hi-index | 0.00 |
Energy costs for data centers continue to rise, already exceeding $15 billion yearly. Sadly much of this power is wasted. Servers are only busy 10--30% of the time on average, but they are often left on, while idle, utilizing 60% or more of peak power when in the idle state. We introduce a dynamic capacity management policy, AutoScale, that greatly reduces the number of servers needed in data centers driven by unpredictable, time-varying load, while meeting response time SLAs. AutoScale scales the data center capacity, adding or removing servers as needed. AutoScale has two key features: (i) it autonomically maintains just the right amount of spare capacity to handle bursts in the request rate; and (ii) it is robust not just to changes in the request rate of real-world traces, but also request size and server efficiency. We evaluate our dynamic capacity management approach via implementation on a 38-server multi-tier data center, serving a web site of the type seen in Facebook or Amazon, with a key-value store workload. We demonstrate that AutoScale vastly improves upon existing dynamic capacity management policies with respect to meeting SLAs and robustness.