A Generalized Analytic Performance Model of Distributed Systems that Perform N Tasks Using P Fault-Prone Processors

Authors:
Gehan Weerasinghe;Imad Antonios;Lester Lipsky
Affiliations:
-;-;-
Venue:
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Year:
2002

Citing 16
Cited 2

Analysis of a composite performance reliability measure for fault-tolerant systems

Journal of the ACM (JACM)
Hierarchical Modeling of Availability in Distributed Systems

IEEE Transactions on Software Engineering
Supporting Fault-Tolerant Parallel Programming in Linda

IEEE Transactions on Parallel and Distributed Systems
Determining Redundancy Levels for Fault Tolerant Real-Time Systems

IEEE Transactions on Computers - Special issue on fault-tolerant computing
Performability Analysis: A New Algorithm

IEEE Transactions on Computers
Performance and reliability analysis of computer systems: an example-based approach using the SHARPE software package

Performance and reliability analysis of computer systems: an example-based approach using the SHARPE software package
In search of clusters (2nd ed.)

In search of clusters (2nd ed.)
High Performance Cluster Computing: Architectures and Systems

High Performance Cluster Computing: Architectures and Systems
Probability and Statistics with Reliability, Queuing and Computer Science Applications

Probability and Statistics with Reliability, Queuing and Computer Science Applications
Fault-Tolerant Parallel Computation

Fault-Tolerant Parallel Computation
Metacomputing with MILAN

HCW '99 Proceedings of the Eighth Heterogeneous Computing Workshop
Adaptive fault tolerance and graceful degradation under dynamic hard real-time scheduling

RTSS '97 Proceedings of the 18th IEEE Real-Time Systems Symposium
Value-Driven Resource Assignment in Object-Oriented Real-Time Dependable Systems

WORDS '97 Proceedings of the 3rd Workshop on Object-Oriented Real-Time Dependable Systems - (WORDS '97)
On Markov Reward Modeling with FSPNs

IPDS '00 Proceedings of the 4th International Computer Performance and Dependability Symposium
An Analytic Performance Model of Parallel Systems that Perform N Tasks Using P Processors That Can Fail

NCA '01 Proceedings of the IEEE International Symposium on Network Computing and Applications (NCA'01)
On Evaluating the Performability of Degradable Computing Systems

IEEE Transactions on Computers

Genetic Algorithm Based Scheduling of Meta-Tasks with Stochastic Execution Times in Heterogeneous Computing Systemst1

Cluster Computing
Dynamic resource allocation of computer clusters with probabilistic workloads

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

A family of Markov models for analyzing the performance of parallel processors that execute a job consisting of N independent tasks using P fault-prone processors is presented in this paper. This study extends our previous study by allowing idle processors to fail, and also by developing performance models to analyze the case where one processor is fail-safe. The models are based on Markov Chains with states representing service, and failure rates with k (0