Adaptive resource provisioning for read intensive multi-tier applications in the cloud

Authors:
Waheed Iqbal;Matthew N. Dailey;David Carrera;Paul Janecek
Affiliations:
Computer Science and Information Management, Asian Institute of Technology, Thailand;Computer Science and Information Management, Asian Institute of Technology, Thailand;Technical University of Catalonia (UPC), Barcelona Supercomputing Center (BSC), Spain;Computer Science and Information Management, Asian Institute of Technology, Thailand
Venue:
Future Generation Computer Systems
Year:
2011

Citing 15
Cited 10

An analytical model for multi-tier internet services and its applications

SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Virtual Clusters for Grid Communities

CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
Resource management for clusters of virtual machines

CCGRID '05 Proceedings of the Fifth IEEE International Symposium on Cluster Computing and the Grid - Volume 01
Adaptive control of virtualized resources in utility computing environments

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Agile dynamic provisioning of multi-tier Internet applications

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
The Art of Capacity Planning: Scaling Web Resources

The Art of Capacity Planning: Scaling Web Resources
Hardness of Approximation and Greedy Algorithms for the Adaptation Problem in Virtual Environments

ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing
Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility

Future Generation Computer Systems
SLA-Driven Adaptive Resource Management for Web Applications on a Heterogeneous Compute Cloud

CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Web Server Farm in the Cloud: Performance Evaluation and Dynamic Architecture

CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Autonomic mix-aware provisioning for non-stationary data center workloads

Proceedings of the 7th international conference on Autonomic computing
From infrastructure delivery to service management in clouds

Future Generation Computer Systems
Suspending, migrating and resuming HPC virtual clusters

Future Generation Computer Systems
Statistical machine learning makes automatic control practical for internet datacenters

HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
SLA-Driven automatic bottleneck detection and resolution for read intensive multi-tier applications hosted on a cloud

GPC'10 Proceedings of the 5th international conference on Advances in Grid and Pervasive Computing

Self-management challenges for multi-cloud architectures

ServiceWave'11 Proceedings of the 4th European conference on Towards a service-based internet
Feedback-based optimization of a private cloud

Future Generation Computer Systems
Lightweight Resource Scaling for Cloud Applications

CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Unsupervised Neural Predictor to Auto-administrate the Cloud Infrastructure

UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Transforming reactive auto-scaling into proactive auto-scaling

Proceedings of the 3rd International Workshop on Cloud Data and Platforms
SCAling: SLA-driven cloud auto-scaling

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Survey Cloud monitoring: A survey

Computer Networks: The International Journal of Computer and Telecommunications Networking
Application-Centric resource provisioning for amazon EC2 spot instances

Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
An energy-efficient self-provisioning approach for cloud resources management

ACM SIGOPS Operating Systems Review
Enabling cost-aware and adaptive elasticity of multi-tier cloud applications

Future Generation Computer Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A Service-Level Agreement (SLA) provides surety for specific quality attributes to the consumers of services. However, current SLAs offered by cloud infrastructure providers do not address response time, which, from the user's point of view, is the most important quality attribute for Web applications. Satisfying a maximum average response time guarantee for Web applications is difficult for two main reasons: first, traffic patterns are highly dynamic and difficult to predict accurately; second, the complex nature of multi-tier Web applications increases the difficulty of identifying bottlenecks and resolving them automatically. This paper proposes a methodology and presents a working prototype system for automatic detection and resolution of bottlenecks in a multi-tier Web application hosted on a cloud in order to satisfy specific maximum response time requirements. It also proposes a method for identifying and retracting over-provisioned resources in multi-tier cloud-hosted Web applications. We demonstrate the feasibility of the approach in an experimental evaluation with a testbed EUCALYPTUS-based cloud and a synthetic workload. Automatic bottleneck detection and resolution under dynamic resource management has the potential to enable cloud infrastructure providers to provide SLAs for Web applications that guarantee specific response time requirements while minimizing resource utilization.