A partitioning algorithm for distributed software systems design
Information Sciences: an International Journal
Task Allocation and Precedence Relations for Distributed Real-Time Systems
IEEE Transactions on Computers
The MAFT Architecture for Distributed Fault Tolerance
IEEE Transactions on Computers - Fault-Tolerant Computing
Distributed functions allocation for reliability and delay optimization
ACM '86 Proceedings of 1986 ACM Fall joint computer conference
Task assignment in distributed systems
Task assignment in distributed systems
Problem-Solving Methods in Artificial Intelligence
Problem-Solving Methods in Artificial Intelligence
Replicated Process Allocation for Load Distribution in Fault-Tolerant Multicomputers
IEEE Transactions on Computers
Task Allocation Algorithms for Maximizing Reliability of Distributed Computing Systems
IEEE Transactions on Computers
IEEE Transactions on Computers
Safety and Reliability Driven Task Allocation in Distributed Systems
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Computers
IEEE Transactions on Parallel and Distributed Systems
ICPP '97 Proceedings of the international Conference on Parallel Processing
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Reliability and cost optimization in distributed computing systems
Computers and Operations Research
ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Process Allocation for Load Distribution in Fault-Tolerant Multicomputers
FTCS '95 Proceedings of the Twenty-Fifth International Symposium on Fault-Tolerant Computing
Efficient Assignment and Scheduling for Heterogeneous DSP Systems
IEEE Transactions on Parallel and Distributed Systems
Task allocation for maximizing reliability of distributed systems: a simulated annealing approach
Journal of Parallel and Distributed Computing
Journal of Systems and Software
Cost minimization while satisfying hard/soft timing constraints for heterogeneous embedded systems
ACM Transactions on Design Automation of Electronic Systems (TODAES)
Reliability versus performance for critical applications
Journal of Parallel and Distributed Computing
Analyzing scheduling with transient failures
Information Processing Letters
Computers and Industrial Engineering
Proceedings of the 2009 workshop on Resiliency in high performance
The Journal of Supercomputing
Performance under Failures of DAG-based Parallel Computing
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Scheduling on the Grid via multi-state resource availability prediction
GRID '08 Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing
A memetic algorithm for reliability-based dynamic scheduling in heterogeneous computing environments
PDCS '07 Proceedings of the 19th IASTED International Conference on Parallel and Distributed Computing and Systems
Future Generation Computer Systems
The decision model of task allocation for constrained stochastic distributed systems
Computers and Industrial Engineering
NPC'07 Proceedings of the 2007 IFIP international conference on Network and parallel computing
Reliability-aware scheduling strategy for heterogeneous distributed computing systems
Journal of Parallel and Distributed Computing
Availability Prediction Based Replication Strategies for Grid Environments
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Journal of Systems and Software
Lifetime reliability-aware task allocation and scheduling for MPSoC platforms
Proceedings of the Conference on Design, Automation and Test in Europe
A task remapping technique for reliable multi-core embedded systems
CODES/ISSS '10 Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
A probabilistic approach for fault tolerant multiprocessor real-time scheduling
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Reliability-driven deployment optimization for embedded systems
Journal of Systems and Software
Task assignment in heterogeneous computing systems using an effective iterated greedy algorithm
Journal of Systems and Software
Cloud-DLS: Dynamic trusted scheduling for Cloud computing
Expert Systems with Applications: An International Journal
Journal of Parallel and Distributed Computing
Satisfaction-based query replication
Distributed and Parallel Databases
Performance implications of failures in large-scale cluster scheduling
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
A novel task scheduling for heterogeneous systems
ICESS'04 Proceedings of the First international conference on Embedded Software and Systems
Architecture-Driven reliability and energy optimization for complex embedded systems
QoSA'10 Proceedings of the 6th international conference on Quality of Software Architectures: research into Practice - Reality and Gaps
Bayesian Cognitive Model in Scheduling Algorithm for Data Intensive Computing
Journal of Grid Computing
An enhanced DGIDE platform for intrusion detection
ATC'07 Proceedings of the 4th international conference on Autonomic and Trusted Computing
Maximal profit service task partition and distribution in computer grid
Computers and Industrial Engineering
Journal of Parallel and Distributed Computing
Quantitative reactive modeling and verification
Computer Science - Research and Development
Hi-index | 14.99 |
For distributed systems, system reliability is defined as the probability that the system can run an entire task successfully. When the system's hardware configuration is fixed, the system reliability is mainly dependent on the software design. The task allocation problem is addressed with the goal of maximizing the system reliability. A quantitative problem model, algorithms for optimal and suboptimal solutions, and simulation results are provided and discussed.