Distributed program reliability analysis
IEEE Transactions on Software Engineering
Performance and Reliability Analysis Using Directed Acyclic Graphs
IEEE Transactions on Software Engineering
Parallel computing (2nd ed.): theory and practice
Parallel computing (2nd ed.): theory and practice
The distributed program reliability analysis on star topologies
Computers and Operations Research
An Efficient SuperGrid Protocol for High Availability and Load Balancing
IEEE Transactions on Computers
Probability and statistics with reliability, queuing and computer science applications
Probability and statistics with reliability, queuing and computer science applications
A taxonomy and survey of grid resource management systems for distributed computing
Software—Practice & Experience
The Completion Time of Programs on Processors Subject to Failure and Repair
IEEE Transactions on Computers
Adaptive Computing on the Grid Using AppLeS
IEEE Transactions on Parallel and Distributed Systems
Reliability Analysis of Grid Computing Systems
PRDC '02 Proceedings of the 2002 Pacific Rim International Symposium on Dependable Computing
The Grid 2: Blueprint for a New Computing Infrastructure
The Grid 2: Blueprint for a New Computing Infrastructure
Computing System Reliability: Models And Analysis
Computing System Reliability: Models And Analysis
The Anatomy of the Grid: Enabling Scalable Virtual Organizations
International Journal of High Performance Computing Applications
On Evaluating the Performability of Degradable Computing Systems
IEEE Transactions on Computers
International Journal of Bioinformatics Research and Applications
QoS-driven self-healing web service composition based on performance prediction
Journal of Computer Science and Technology
Identity-Based Authentication for Cloud Computing
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Task scheduling modelling and reliability evaluation of grid services using coloured Petri nets
Future Generation Computer Systems
Cloud Computing Towards Technological Convergence
International Journal of Cloud Applications and Computing
Performance evaluation of cloud service considering fault recovery
The Journal of Supercomputing
Hi-index | 14.98 |
Grid computing is a newly emerging technology aimed at large-scale resource sharing and global-area collaboration. It is the next step in the evolution of parallel and distributed computing. Due to the largeness and complexity of the grid system, its performance and reliability are difficult to model, analyze, and evaluate. This paper presents a model that relaxes some assumptions made in prior research on distributed systems that were inappropriate for grid computing. The paper proposes a virtual tree-structured model of the grid service. This model simplifies the physical structure of a grid service, allows service performance (execution time) to be efficiently evaluated, and takes into account data dependence and failure correlation. Based on the model, an algorithm for evaluating the grid service time distribution and the service reliability indices is suggested. The algorithm is based on Graph theory and probability theory. Illustrative examples and a real case study of the BioGrid are presented.