Software rejuvenation in the cloud

Authors:
Dario Bruneo;Francesco Longo;Antonio Puliafito;Marco Scarpa;Salvatore Distefano
Affiliations:
Università degli Studi di Messina, Messina (ME), Italia;Università degli Studi di Messina, Messina (ME), Italia;Università degli Studi di Messina, Messina (ME), Italia;Università degli Studi di Messina, Messina (ME), Italia;Politecnico di Milano, Milano (MI), Italia
Venue:
Proceedings of the 5th International ICST Conference on Simulation Tools and Techniques
Year:
2012

Citing 17
Cited 0

Reliability engineering handbook (vol. 1)

Reliability engineering handbook (vol. 1)
Analysis of Preventive Maintenance in Transactions Based Software Systems

IEEE Transactions on Computers
A Methodology for Detection and Estimation of Software Aging

ISSRE '98 Proceedings of the The Ninth International Symposium on Software Reliability Engineering
A Measurement-Based Model for Estimation of Resource Exhaustion in Operational Software Systems

ISSRE '99 Proceedings of the 10th International Symposium on Software Reliability Engineering
A Comprehensive Model for Software Rejuvenation

IEEE Transactions on Dependable and Secure Computing
Scientific Cloud Computing: Early Definition and Experience

HPCC '08 Proceedings of the 2008 10th IEEE International Conference on High Performance Computing and Communications
Applying Symbolic Techniques to the Representation of Non-Markovian Models with Continuous PH Distributions

EPEW '09 Proceedings of the 6th European Performance Engineering Workshop on Computer Performance Engineering
Performance Evaluation of Cloud Computing Offerings

ADVCOMP '09 Proceedings of the 2009 Third International Conference on Advanced Engineering Computing and Applications in Sciences
Efficient resource provisioning in compute clouds via VM multiplexing

Proceedings of the 7th international conference on Autonomic computing
Symbolic Representation Techniques in Dynamic Reliability Evaluation

HASE '10 Proceedings of the 2010 IEEE 12th International Symposium on High-Assurance Systems Engineering
Availability Assessment of HA Standby Redundant Clusters

SRDS '10 Proceedings of the 2010 29th IEEE Symposium on Reliable Distributed Systems
End-to-End Performability Analysis for Infrastructure-as-a-Service Cloud: An Interacting Stochastic Models Approach

PRDC '10 Proceedings of the 2010 IEEE 16th Pacific Rim International Symposium on Dependable Computing
Reservoir - When One Cloud Is Not Enough

Computer
On the Performance Variability of Production Cloud Services

CCGRID '11 Proceedings of the 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing
Live Virtual Machine Migration via Asynchronous Replication and State Synchronization

IEEE Transactions on Parallel and Distributed Systems
Performance Analysis of Cloud Computing Centers Using M/G/m/m+r Queuing Systems

IEEE Transactions on Parallel and Distributed Systems
SP 800-145. The NIST Definition of Cloud Computing

SP 800-145. The NIST Definition of Cloud Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we investigate how software rejuvenation can be used in a Cloud environment to increase the availability of a virtualized system composed of a single virtual machine monitor (VMM) on top of which a certain number of virtual machines (VMs) can be instantiated. We start from the assumption that the aging of a VMM increases with the number of VMs it is managing, thus characterizing the problem in terms of dynamic reliability. Therefore, by identifying the age of the VMM with its reliability and based on the conservation of reliability principle, we characterize the time to failure of the VMM through continuous phase type distributions. The system availability is thus modeled by an expanded continuous time Markov chain expressed in terms of Kronecker algebra in order to face the state space explosion and to keep memory of the age reached by the VMM in case the number of the hosted VMs change. Time-based rejuvenation is taken into consideration and the optimal timer is evaluated in order to maximize the VMM availability.