An analytical model for multi-tier internet services and its applications
SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Virtual Clusters for Grid Communities
CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
Resource management for clusters of virtual machines
CCGRID '05 Proceedings of the Fifth IEEE International Symposium on Cluster Computing and the Grid - Volume 01
Adaptive control of virtualized resources in utility computing environments
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Agile dynamic provisioning of multi-tier Internet applications
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
The Art of Capacity Planning: Scaling Web Resources
The Art of Capacity Planning: Scaling Web Resources
Hardness of Approximation and Greedy Algorithms for the Adaptation Problem in Virtual Environments
ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing
Future Generation Computer Systems
SLA-Driven Adaptive Resource Management for Web Applications on a Heterogeneous Compute Cloud
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Web Server Farm in the Cloud: Performance Evaluation and Dynamic Architecture
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Autonomic mix-aware provisioning for non-stationary data center workloads
Proceedings of the 7th international conference on Autonomic computing
From infrastructure delivery to service management in clouds
Future Generation Computer Systems
Suspending, migrating and resuming HPC virtual clusters
Future Generation Computer Systems
Statistical machine learning makes automatic control practical for internet datacenters
HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
GPC'10 Proceedings of the 5th international conference on Advances in Grid and Pervasive Computing
Self-management challenges for multi-cloud architectures
ServiceWave'11 Proceedings of the 4th European conference on Towards a service-based internet
Feedback-based optimization of a private cloud
Future Generation Computer Systems
Lightweight Resource Scaling for Cloud Applications
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Unsupervised Neural Predictor to Auto-administrate the Cloud Infrastructure
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Transforming reactive auto-scaling into proactive auto-scaling
Proceedings of the 3rd International Workshop on Cloud Data and Platforms
SCAling: SLA-driven cloud auto-scaling
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Survey Cloud monitoring: A survey
Computer Networks: The International Journal of Computer and Telecommunications Networking
Application-Centric resource provisioning for amazon EC2 spot instances
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
An energy-efficient self-provisioning approach for cloud resources management
ACM SIGOPS Operating Systems Review
Enabling cost-aware and adaptive elasticity of multi-tier cloud applications
Future Generation Computer Systems
Hi-index | 0.00 |
A Service-Level Agreement (SLA) provides surety for specific quality attributes to the consumers of services. However, current SLAs offered by cloud infrastructure providers do not address response time, which, from the user's point of view, is the most important quality attribute for Web applications. Satisfying a maximum average response time guarantee for Web applications is difficult for two main reasons: first, traffic patterns are highly dynamic and difficult to predict accurately; second, the complex nature of multi-tier Web applications increases the difficulty of identifying bottlenecks and resolving them automatically. This paper proposes a methodology and presents a working prototype system for automatic detection and resolution of bottlenecks in a multi-tier Web application hosted on a cloud in order to satisfy specific maximum response time requirements. It also proposes a method for identifying and retracting over-provisioned resources in multi-tier cloud-hosted Web applications. We demonstrate the feasibility of the approach in an experimental evaluation with a testbed EUCALYPTUS-based cloud and a synthetic workload. Automatic bottleneck detection and resolution under dynamic resource management has the potential to enable cloud infrastructure providers to provide SLAs for Web applications that guarantee specific response time requirements while minimizing resource utilization.