Bandwidth availability of multiple-bus multiprocessors
IEEE Transactions on Computers
Survey of software tools for evaluating reliability, availability, and serviceability
ACM Computing Surveys (CSUR)
Sensitivity analysis of reliability and performability measures for multiprocessor systems
SIGMETRICS '88 Proceedings of the 1988 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Interconnection Networks for Parallel and Distributed Processing
Interconnection Networks for Parallel and Distributed Processing
SIDECAR: design support for reliability
DAC '91 Proceedings of the 28th ACM/IEEE Design Automation Conference
An annotated bibliography of dependable distributed computing
ACM SIGOPS Operating Systems Review
Fault-tolerant task management and load re-distribution on massively parallel hypercube systems
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Improving the dependability of network management systems
International Journal of Network Management
Hi-index | 4.10 |
A tutorial on dependability and performance-related dependability models for multiprocessors is presented. Multiprocessors are classified as having shared-memory or distributed-memory architectures, and some fundamental dependability modeling concepts. Reliability models based on four types of reliability evaluation techniques (terminal, multiterminal, task-based, and network reliability) are examined. The status of research efforts on performance-related dependability is discussed, and the models' effectiveness is illustrated with a few numerical examples. A brief survey of software packages for dependability computation in included.