Provisioning multi-tier cloud applications using statistical bounds on sojourn time

Authors:
Upendra Sharma;Prashant Shenoy;Donald F. Towsley
Affiliations:
University of Massachusetts, Amherst, MA, USA;University of Massachusetts, Amherst, MA, USA;University of Massachusetts, Amherst, MA, USA
Venue:
Proceedings of the 9th international conference on Autonomic computing
Year:
2012

Citing 16
Cited 1

Processor-sharing queues: some progress in analysis

Queueing Systems: Theory and Applications
A methodology for workload characterization of E-commerce sites

Proceedings of the 1st ACM conference on Electronic commerce
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach

IEEE Transactions on Parallel and Distributed Systems
Passage time distributions in large Markov chains

SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites

IEEE Transactions on Computers
Performance Evaluation with Heavy Tailed Distributions

TOOLS '00 Proceedings of the 11th International Conference on Computer Performance Evaluation: Modelling Techniques and Tools
An Architectural Evaluation of Java TPC-W

HPCA '01 Proceedings of the 7th International Symposium on High-Performance Computer Architecture
Web Server Software Architectures

IEEE Internet Computing
An analytical model for multi-tier internet services and its applications

SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Resource Allocation for Autonomic Data Centers using Analytic Performance Models

ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Performance modeling and system management for multi-component online services

NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Model-based resource provisioning in a web service utility

USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Dynamo: amazon's highly available key-value store

Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Agile dynamic provisioning of multi-tier Internet applications

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
A regression-based analytic model for capacity planning of multi-tier applications

Cluster Computing
Autonomic Provisioning of Backend Databases in Dynamic Content Web Servers

ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing

Understanding, modelling, and improving the performance of web applications in multicore virtualised environments

Proceedings of the 5th ACM/SPEC international conference on Performance engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present a simple and effective approach for resource provisioning to achieve a percentile bound on the end to end response time of a multi-tier application. We, at first, model the multi-tier application as an open tandem network of M/G/1-PS queues and develop a method that produces a near optimal application configuration, i.e, number of servers at each tier, to meet the percentile bound in a homogeneous server environment -- using a single type of server. We then extend our solution to a K-server case and our technique demonstrates a good accuracy, independent of the variability of service-times. Our approach demonstrates a provisioning error of no more than 3% compared to a 140% worst case provisioning error obtained by techniques based on an M/M/1-FCFS queue model. In addition, we extend our approach to handle a heterogenous server environment, i.e., with multiple types of servers. We find that fewer high-capacity servers are preferable for high percentile provisioning. Finally, we extend our approach to account for the rental cost of each server-type and compute a cost efficient application configuration with savings of over 80%. We demonstrate the applicability of our approach in a real world system by employing it to provision the two tiers of the java implementation of TPC-W -- a multi-tier transactional web benchmark that represents an e-commerce web application, i.e. an online bookstore.