Dual time-scale distributed capacity allocation and load redirect algorithms for cloud systems

Authors:
Danilo Ardagna;Sara Casolari;Michele Colajanni;Barbara Panicucci
Affiliations:
Politecnico di Milano, Dipartimento di Elettronica Informazione, Italy;Universití di Modena e Reggio Emilia, Dipartimento di Ingegneria dell'Informazione, Italy;Universití di Modena e Reggio Emilia, Dipartimento di Ingegneria dell'Informazione, Italy;Politecnico di Milano, Dipartimento di Elettronica Informazione, Italy and Universití di Modena e Reggio Emilia, Dipartimento di Scienze e Metodi dell'Ingegneria, Italy
Venue:
Journal of Parallel and Distributed Computing
Year:
2012

Citing 27
Cited 1

Queueing networks and Markov chains: modeling and performance evaluation with computer science applications

Queueing networks and Markov chains: modeling and performance evaluation with computer science applications
Heavy-tailed probability distributions in the World Wide Web

A practical guide to heavy tails
Distributed cooperative Apache web server

Proceedings of the 10th international conference on World Wide Web
NCSA's World Wide Web Server: Design and Performance

Computer
Request Redirection Algorithms for Distributed Web Systems

IEEE Transactions on Parallel and Distributed Systems
Homeostatic and Tendency-Based CPU Load Predictions

IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Automatic ARIMA Time Series Modeling for Adaptive I/O Prefetching

IEEE Transactions on Parallel and Distributed Systems
Proactive Hot Spot Avoidance for Web Server Dependability

SRDS '04 Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems
Resource Allocation for Autonomic Data Centers using Analytic Performance Models

ICAC '05 Proceedings of the Second International Conference on Automatic Computing
On the predictability of large transfer TCP throughput

Proceedings of the 2005 conference on Applications, technologies, architectures, and protocols for computer communications
Predictability of Web-Server Traffic Congestion

WCW '05 Proceedings of the 10th International Workshop on Web Content Caching and Distribution
Dynamic load balancing experiments in a grid

CCGRID '05 Proceedings of the Fifth IEEE International Symposium on Cluster Computing and the Grid (CCGrid'05) - Volume 2 - Volume 02
Dynamic estimation of CPU demand of web traffic

valuetools '06 Proceedings of the 1st international conference on Performance evaluation methodolgies and tools
Analytic modeling of multitier Internet applications

ACM Transactions on the Web (TWEB)
A scalable application placement controller for enterprise data centers

Proceedings of the 16th international conference on World Wide Web
Load shedding and distributed resource control of stream processing networks

Performance Evaluation
CPU demand for web serving: Measurement analysis and dynamic estimation

Performance Evaluation
Models and framework for supporting runtime decisions in Web-based systems

ACM Transactions on the Web (TWEB)
High-Performance Resource Allocation and Request Redirection Algorithms for Web Clusters

IEEE Transactions on Parallel and Distributed Systems
Autonomic Request Management Algorithms for Geographically Distributed Internet-Based Systems

SASO '08 Proceedings of the 2008 Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems
Self-Aggregation Techniques for Load Balancing in Distributed Systems

SASO '08 Proceedings of the 2008 Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems
1000 islands: an integrated approach to resource management for virtualized data centers

Cluster Computing
Short-term prediction models for server management in Internet-based contexts

Decision Support Systems
Joint admission control and resource allocation in virtualized servers

Journal of Parallel and Distributed Computing
Fiber optic communication technologies: what's needed for datacenter network operations

IEEE Communications Magazine
Flexible Distributed Capacity Allocation and Load Redirect Algorithms for Cloud Systems

CLOUD '11 Proceedings of the 2011 IEEE 4th International Conference on Cloud Computing
Energy-Aware Autonomic Resource Allocation in Multitier Virtualized Environments

IEEE Transactions on Services Computing

An Inter-cloud Outsourcing Model to Scale Performance, Availability and Security

UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Resource management remains one of the main issues of cloud computing providers because system resources have to be continuously allocated to handle workload fluctuations while guaranteeing Service Level Agreements (SLA) to the end users. In this paper, we propose novel capacity allocation algorithms able to coordinate multiple distributed resource controllers operating in geographically distributed cloud sites. Capacity allocation solutions are integrated with a load redirection mechanism which, when necessary, distributes incoming requests among different sites. The overall goal is to minimize the costs of allocated resources in terms of virtual machines, while guaranteeing SLA constraints expressed as a threshold on the average response time. We propose a distributed solution which integrates workload prediction and distributed non-linear optimization techniques. Experiments show how the proposed solutions improve other heuristics proposed in literature without penalizing SLAs, and our results are close to the global optimum which can be obtained by an oracle with a perfect knowledge about the future offered load.