Autonomic mix-aware provisioning for non-stationary data center workloads

Authors:
Rahul Singh;Upendra Sharma;Emmanuel Cecchet;Prashant Shenoy
Affiliations:
University of Massachusetts, Amherst, MA, USA;University of Massachusetts, Amherst, MA, USA;University of Massachusetts, Amherst, MA, USA;University of Massachusetts, Amherst, MA, USA
Venue:
Proceedings of the 7th international conference on Autonomic computing
Year:
2010

Citing 18
Cited 13

A methodology for workload characterization of E-commerce sites

Proceedings of the 1st ACM conference on Electronic commerce
In search of invariants for e-business workloads

Proceedings of the 2nd ACM conference on Electronic commerce
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach

IEEE Transactions on Parallel and Distributed Systems
An Architectural Evaluation of Java TPC-W

HPCA '01 Proceedings of the 7th International Symposium on High-Performance Computer Architecture
Xen and the art of virtualization

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Virtual Distributed Environments in a Shared Infrastructure

Computer
Dynamic Provisioning of Multi-tier Internet Applications

ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Resource Allocation for Autonomic Data Centers using Analytic Performance Models

ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Performance modeling and system management for multi-component online services

NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Sharing networked resources with brokered leases

ATEC '06 Proceedings of the annual conference on USENIX '06 Annual Technical Conference
Exploiting nonstationarity for performance prediction

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Integrating Adaptive Components: An Emerging Challenge in Performance-Adaptive Systems and a Server Farm Case-Study

RTSS '07 Proceedings of the 28th IEEE International Real-Time Systems Symposium
A regression-based analytic model for capacity planning of multi-tier applications

Cluster Computing
Automatic request categorization in internet services

ACM SIGMETRICS Performance Evaluation Review
Autonomic Provisioning of Backend Databases in Dynamic Content Web Servers

ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing
Workload Analysis and Demand Prediction of Enterprise Data Center Applications

IISWC '07 Proceedings of the 2007 IEEE 10th International Symposium on Workload Characterization
vManage: loosely coupled platform and virtualization management in data centers

ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Empirical evaluation of latency-sensitive application performance in the cloud

MMSys '10 Proceedings of the first annual ACM SIGMM conference on Multimedia systems

Adaptive resource provisioning for read intensive multi-tier applications in the cloud

Future Generation Computer Systems
PERFUME: power and performance guarantee with fuzzy MIMO control in virtualized servers

Proceedings of the Nineteenth International Workshop on Quality of Service
Enacting SLAs in clouds using rules

Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Self-management challenges for multi-cloud architectures

ServiceWave'11 Proceedings of the 4th European conference on Towards a service-based internet
SLA-based resource provisioning for heterogeneous workloads in a virtualized cloud datacenter

ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
Regression-based resource provisioning for session slowdown guarantee in multi-tier Internet servers

Journal of Parallel and Distributed Computing
Dynamic Replication in Service-Oriented Systems

CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Application-aware cross-layer virtual machine resource management

Proceedings of the 9th international conference on Autonomic computing
AROMA: automated resource allocation and configuration of mapreduce environment in the cloud

Proceedings of the 9th international conference on Autonomic computing
Transactional auto scaler: elastic scaling of in-memory transactional data grids

Proceedings of the 9th international conference on Autonomic computing
Usage patterns in multi-tenant data centers: a temporal perspective

Proceedings of the 9th international conference on Autonomic computing
On load balancing: a mix-aware algorithm for heterogeneous systems

Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
Autonomic Provisioning with Self-Adaptive Neural Fuzzy Control for Percentile-Based Delay Guarantee

ACM Transactions on Autonomous and Adaptive Systems (TAAS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Online Internet applications see dynamic workloads that fluctuate over multiple time scales. This paper argues that the non-stationarity in Internet application workloads, which causes the request mix to change over time, can have a significant impact on the overall processing demands imposed on data center servers. We propose a novel mix-aware dynamic provisioning technique that handles both the non-stationarity in the workload as well as changes in request volumes when allocating server capacity in Internet data centers. Our technique employs the k-means clustering algorithm to automatically determine the workload mix and a queuing model to predict the server capacity for a given workload mix. We implement a prototype provisioning system that incorporates our technique and experimentally evaluate its efficacy on a laboratory Linux data center running the TPC-W web benchmark. Our results show that our k-means clustering technique accurately captures workload mix changes in Internet applications. We also demonstrate that mix-aware dynamic provisioning eliminates SLA violations due to under-provisioning with non-stationary web workloads, and that it offers a better resource usage by reducing over-provisioning when compared to a baseline provisioning approach that only reacts to workload volume changes. We also present a case study of our provisioning approach on Amazon's EC2 cloud platform.