A cost-aware auto-scaling approach using the workload prediction in service clouds

  • Authors:
  • Jingqi Yang;Chuanchang Liu;Yanlei Shang;Bo Cheng;Zexiang Mao;Chunhong Liu;Lisha Niu;Junliang Chen

  • Affiliations:
  • State Key Lab of Networking and Switching Technology, Beijing University of Posts & Telecommunications, Beijing, China 100876;State Key Lab of Networking and Switching Technology, Beijing University of Posts & Telecommunications, Beijing, China 100876;State Key Lab of Networking and Switching Technology, Beijing University of Posts & Telecommunications, Beijing, China 100876;State Key Lab of Networking and Switching Technology, Beijing University of Posts & Telecommunications, Beijing, China 100876;State Key Lab of Networking and Switching Technology, Beijing University of Posts & Telecommunications, Beijing, China 100876;State Key Lab of Networking and Switching Technology, Beijing University of Posts & Telecommunications, Beijing, China 100876;State Key Lab of Networking and Switching Technology, Beijing University of Posts & Telecommunications, Beijing, China 100876;State Key Lab of Networking and Switching Technology, Beijing University of Posts & Telecommunications, Beijing, China 100876

  • Venue:
  • Information Systems Frontiers
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

Service clouds are distributed infrastructures which deploys communication services in clouds. The scalability is an important characteristic of service clouds. With the scalability, the service cloud can offer on-demand computing power and storage capacities to different services. In order to achieve the scalability, we need to know when and how to scale virtual resources assigned to different services. In this paper, a novel service cloud architecture is presented, and a linear regression model is used to predict the workload. Based on this predicted workload, an auto-scaling mechanism is proposed to scale virtual resources at different resource levels in service clouds. The auto-scaling mechanism combines the real-time scaling and the pre-scaling. Finally experimental results are provided to demonstrate that our approach can satisfy the user Service Level Agreement (SLA) while keeping scaling costs low.