Distributed program reliability analysis
IEEE Transactions on Software Engineering
High-throughput resource management
The grid
Chameleon: A Software Infrastructure for Adaptive Fault Tolerance
IEEE Transactions on Parallel and Distributed Systems
The distributed program reliability analysis on star topologies
Computers and Operations Research
An Efficient SuperGrid Protocol for High Availability and Load Balancing
IEEE Transactions on Computers
Probability and statistics with reliability, queuing and computer science applications
Probability and statistics with reliability, queuing and computer science applications
Parallel Processing of Adaptive Meshes with Load Balancing
IEEE Transactions on Parallel and Distributed Systems
Software Reliability
A taxonomy and survey of grid resource management systems for distributed computing
Software—Practice & Experience
Reliability Analysis of Distributed Systems Based on a Fast Reliability Algorithm
IEEE Transactions on Parallel and Distributed Systems
A Unified Task-Based Dependability Model for Hypercube Computers
IEEE Transactions on Parallel and Distributed Systems
Almost Certain Fault Diagnosis Through Algorithm-Based Fault Tolerance
IEEE Transactions on Parallel and Distributed Systems
A computational economy for grid computing and its implementation in the Nimrod-G resource broker
Future Generation Computer Systems - Grid computing: Towards a new computing infrastructure
Adaptive Computing on the Grid Using AppLeS
IEEE Transactions on Parallel and Distributed Systems
Reliability Analysis of Grid Computing Systems
PRDC '02 Proceedings of the 2002 Pacific Rim International Symposium on Dependable Computing
The Grid 2: Blueprint for a New Computing Infrastructure
The Grid 2: Blueprint for a New Computing Infrastructure
Computing System Reliability: Models And Analysis
Computing System Reliability: Models And Analysis
The Anatomy of the Grid: Enabling Scalable Virtual Organizations
International Journal of High Performance Computing Applications
ARMS: An agent-based resource management system for grid computing
Scientific Programming
Identity-Based Authentication for Cloud Computing
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
A hierarchical reliability-driven scheduling algorithm in grid systems
Journal of Parallel and Distributed Computing
Ordinal optimization based approach to the optimal resource allocation of grid computing system
Mathematical and Computer Modelling: An International Journal
Cloud Computing Towards Technological Convergence
International Journal of Cloud Applications and Computing
A survey on reliability in distributed systems
Journal of Computer and System Sciences
Performance evaluation of cloud service considering fault recovery
The Journal of Supercomputing
Bionic Autonomic Nervous Systems for Self-Defense against DoS, Spyware, Malware, Virus, and Fishing
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Performance modelling and analysis of mobile grid computing systems
International Journal of Grid and Utility Computing
Journal of High Speed Networks
Hi-index | 14.98 |
Grid computing is a recently developed technology. Although the developmental tools and techniques for the grid have been extensively studied, grid reliability analysis is not easy because of its complexity. This paper is the first one that presents a hierarchical model for the grid service reliability analysis and evaluation. The hierarchical modeling is mapped to the physical and logical architecture of the grid service system and makes the evaluation and calculation tractable by identifying the independence among layers. Various types of failures are interleaved in the grid computing environment, such as blocking failures, time-out failures, matchmaking failures, network failures, program failures, and resource failures. This paper investigates all of them to achieve a complete picture about grid service reliability. Markov models, Queuing theory, and Graph theory are mainly used here to model, evaluate, and analyze the grid service reliability. Numerical examples are illustrated.