A methodology for workload characterization of E-commerce sites
Proceedings of the 1st ACM conference on Electronic commerce
In search of invariants for e-business workloads
Proceedings of the 2nd ACM conference on Electronic commerce
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach
IEEE Transactions on Parallel and Distributed Systems
An Architectural Evaluation of Java TPC-W
HPCA '01 Proceedings of the 7th International Symposium on High-Performance Computer Architecture
Xen and the art of virtualization
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Dynamic Provisioning of Multi-tier Internet Applications
ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Resource Allocation for Autonomic Data Centers using Analytic Performance Models
ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Performance modeling and system management for multi-component online services
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Sharing networked resources with brokered leases
ATEC '06 Proceedings of the annual conference on USENIX '06 Annual Technical Conference
Exploiting nonstationarity for performance prediction
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
RTSS '07 Proceedings of the 28th IEEE International Real-Time Systems Symposium
Automatic request categorization in internet services
ACM SIGMETRICS Performance Evaluation Review
Autonomic Provisioning of Backend Databases in Dynamic Content Web Servers
ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing
Workload Analysis and Demand Prediction of Enterprise Data Center Applications
IISWC '07 Proceedings of the 2007 IEEE 10th International Symposium on Workload Characterization
vManage: loosely coupled platform and virtualization management in data centers
ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Empirical evaluation of latency-sensitive application performance in the cloud
MMSys '10 Proceedings of the first annual ACM SIGMM conference on Multimedia systems
Adaptive resource provisioning for read intensive multi-tier applications in the cloud
Future Generation Computer Systems
PERFUME: power and performance guarantee with fuzzy MIMO control in virtualized servers
Proceedings of the Nineteenth International Workshop on Quality of Service
Enacting SLAs in clouds using rules
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Self-management challenges for multi-cloud architectures
ServiceWave'11 Proceedings of the 4th European conference on Towards a service-based internet
SLA-based resource provisioning for heterogeneous workloads in a virtualized cloud datacenter
ICA3PP'11 Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part I
Regression-based resource provisioning for session slowdown guarantee in multi-tier Internet servers
Journal of Parallel and Distributed Computing
Dynamic Replication in Service-Oriented Systems
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Application-aware cross-layer virtual machine resource management
Proceedings of the 9th international conference on Autonomic computing
AROMA: automated resource allocation and configuration of mapreduce environment in the cloud
Proceedings of the 9th international conference on Autonomic computing
Transactional auto scaler: elastic scaling of in-memory transactional data grids
Proceedings of the 9th international conference on Autonomic computing
Usage patterns in multi-tenant data centers: a temporal perspective
Proceedings of the 9th international conference on Autonomic computing
On load balancing: a mix-aware algorithm for heterogeneous systems
Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
Autonomic Provisioning with Self-Adaptive Neural Fuzzy Control for Percentile-Based Delay Guarantee
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Hi-index | 0.00 |
Online Internet applications see dynamic workloads that fluctuate over multiple time scales. This paper argues that the non-stationarity in Internet application workloads, which causes the request mix to change over time, can have a significant impact on the overall processing demands imposed on data center servers. We propose a novel mix-aware dynamic provisioning technique that handles both the non-stationarity in the workload as well as changes in request volumes when allocating server capacity in Internet data centers. Our technique employs the k-means clustering algorithm to automatically determine the workload mix and a queuing model to predict the server capacity for a given workload mix. We implement a prototype provisioning system that incorporates our technique and experimentally evaluate its efficacy on a laboratory Linux data center running the TPC-W web benchmark. Our results show that our k-means clustering technique accurately captures workload mix changes in Internet applications. We also demonstrate that mix-aware dynamic provisioning eliminates SLA violations due to under-provisioning with non-stationary web workloads, and that it offers a better resource usage by reducing over-provisioning when compared to a baseline provisioning approach that only reacts to workload volume changes. We also present a case study of our provisioning approach on Amazon's EC2 cloud platform.