SYREL: A Symbolic Reliability Algorithm Based on Path and Cutset Methods
IEEE Transactions on Computers
Hierarchical Modeling of Availability in Distributed Systems
IEEE Transactions on Software Engineering
Distributed functions allocation for reliability and delay optimization
ACM '86 Proceedings of 1986 ACM Fall joint computer conference
Management of Value-Added Services in theTrading-Based System AGORA
Journal of Network and Systems Management
Reliability Analysis in Distributed Systems
IEEE Transactions on Computers
On Distributed Computing Systems Reliability Analysis Under Program Execution Constraints
IEEE Transactions on Computers
CAREL: Computer Aided Reliability Evaluator for Distributed Computing Networks
IEEE Transactions on Parallel and Distributed Systems
Reliability Analysis of Distributed Systems Based on a Fast Reliability Algorithm
IEEE Transactions on Parallel and Distributed Systems
Dependability Modeling and Analysis of Distributed Programs
IEEE Transactions on Software Engineering
Software reliability engineering for client-server systems
ISSRE '96 Proceedings of the The Seventh International Symposium on Software Reliability Engineering
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 15 - Volume 16
Reliability of grid service systems
Computers and Industrial Engineering
Journal of Systems and Software
Optimal task partition and distribution in grid service system with common cause failures
Future Generation Computer Systems - Special section: Information engineering and enterprise architecture in distributed computing environments
A Hierarchical Modeling and Analysis for Grid Service Reliability
IEEE Transactions on Computers
IEEE Transactions on Computers
Computers and Industrial Engineering
Reliability of grid service systems
Computers and Industrial Engineering
CIS'04 Proceedings of the First international conference on Computational and Information Science
A fast algorithm for reliability-oriented task assignment in a distributed system
Computer Communications
Hi-index | 0.02 |
The reliability of distributed processing systems can be expressed in terms of the reliability of the processing elements that run the programs, the reliability of the processing elements holding the required files, and the reliability of the communication links used in file transfers. We introduce two reliability measures, namely, distributed program reliability and distributed system reliability to accurately model the reliability of distributed systems. The first measure describes the probability of successful execution of a distributed program which runs on some processing elements and needs to communicate with other processing elements for remote files, while the second measure describes the probability that all the programs of a given set can run successfully. The notion of minimal file spanning trees is introduced to efficiently evaluate these reliability measures. Graph theory techniques are used to systematically generate file spanning trees that provide all the required connections. Our technique is general and can be used in a dynamic environment for efficient reliability evaluation.