Efficient Autoscaling in the Cloud Using Predictive Models for Workload Forecasting

Authors:
Nilabja Roy;Abhishek Dubey;Aniruddha Gokhale
Affiliations:
-;-;-
Venue:
CLOUD '11 Proceedings of the 2011 IEEE 4th International Conference on Cloud Computing
Year:
2011

Citing 0
Cited 5

Is your cloud elastic enough?: performance modelling the elasticity of infrastructure as a service (IaaS) cloud applications

ICPE '12 Proceedings of the 3rd ACM/SPEC International Conference on Performance Engineering
A Survey on Cloud Computing Elasticity

UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Rebalancing in a multi-cloud environment

Proceedings of the 4th ACM workshop on Scientific cloud computing
A framework for dynamically generating predictive models of workflow execution

WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
A cost-aware auto-scaling approach using the workload prediction in service clouds

Information Systems Frontiers

Quantified Score

Hi-index	0.00

Visualization

Abstract

Large-scale component-based enterprise applications that leverage Cloud resources expect Quality of Service(QoS) guarantees in accordance with service level agreements between the customer and service providers. In the context of Cloud computing, auto scaling mechanisms hold the promise of assuring QoS properties to the applications while simultaneously making efficient use of resources and keeping operational costs low for the service providers. Despite the perceived advantages of auto scaling, realizing the full potential of auto scaling is hard due to multiple challenges stemming from the need to precisely estimate resource usage in the face of significant variability in client workload patterns. This paper makes three contributions to overcome the general lack of effective techniques for workload forecasting and optimal resource allocation. First, it discusses the challenges involved in auto scaling in the cloud. Second, it develops a model-predictive algorithm for workload forecasting that is used for resource auto scaling. Finally, empirical results are provided that demonstrate that resources can be allocated and deal located by our algorithm in a way that satisfies both the application QoS while keeping operational costs low.