Analysis of a composite performance reliability measure for fault-tolerant systems
Journal of the ACM (JACM)
Hierarchical Modeling of Availability in Distributed Systems
IEEE Transactions on Software Engineering
Supporting Fault-Tolerant Parallel Programming in Linda
IEEE Transactions on Parallel and Distributed Systems
Determining Redundancy Levels for Fault Tolerant Real-Time Systems
IEEE Transactions on Computers - Special issue on fault-tolerant computing
Performability Analysis: A New Algorithm
IEEE Transactions on Computers
Performance and reliability analysis of computer systems: an example-based approach using the SHARPE software package
In search of clusters (2nd ed.)
In search of clusters (2nd ed.)
High Performance Cluster Computing: Architectures and Systems
High Performance Cluster Computing: Architectures and Systems
Probability and Statistics with Reliability, Queuing and Computer Science Applications
Probability and Statistics with Reliability, Queuing and Computer Science Applications
Fault-Tolerant Parallel Computation
Fault-Tolerant Parallel Computation
HCW '99 Proceedings of the Eighth Heterogeneous Computing Workshop
Adaptive fault tolerance and graceful degradation under dynamic hard real-time scheduling
RTSS '97 Proceedings of the 18th IEEE Real-Time Systems Symposium
Value-Driven Resource Assignment in Object-Oriented Real-Time Dependable Systems
WORDS '97 Proceedings of the 3rd Workshop on Object-Oriented Real-Time Dependable Systems - (WORDS '97)
On Markov Reward Modeling with FSPNs
IPDS '00 Proceedings of the 4th International Computer Performance and Dependability Symposium
NCA '01 Proceedings of the IEEE International Symposium on Network Computing and Applications (NCA'01)
On Evaluating the Performability of Degradable Computing Systems
IEEE Transactions on Computers
Dynamic resource allocation of computer clusters with probabilistic workloads
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Hi-index | 0.00 |
A family of Markov models for analyzing the performance of parallel processors that execute a job consisting of N independent tasks using P fault-prone processors is presented in this paper. This study extends our previous study by allowing idle processors to fail, and also by developing performance models to analyze the case where one processor is fail-safe. The models are based on Markov Chains with states representing service, and failure rates with k (0