Introduction to algorithms
Task Allocation for Maximizing Reliability of Distributed Computer Systems
IEEE Transactions on Computers
Task Allocation Algorithms for Maximizing Reliability of Distributed Computing Systems
IEEE Transactions on Computers
Journal of Parallel and Distributed Computing - Special issue on parallel evolutionary computing
Generational scheduling for dynamic task management in heterogeneous computing systems
Information Sciences: an International Journal - special issue on parallel and distributed processing
IEEE Transactions on Computers
Future Generation Computer Systems - Special issue on metacomputing
IEEE Transactions on Parallel and Distributed Systems
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Experimental Assessment of Workstation Failures and Their Impact on Checkpointing Systems
FTCS '98 Proceedings of the The Twenty-Eighth Annual International Symposium on Fault-Tolerant Computing
A Dynamic Matching and Scheduling Algorithm for Heterogeneous Computing Systems
HCW '98 Proceedings of the Seventh Heterogeneous Computing Workshop
Dynamic, Competitive Scheduling of Multiple DAGs in a Distributed Heterogeneous Environment
HCW '98 Proceedings of the Seventh Heterogeneous Computing Workshop
ICPP '00 Proceedings of the 2000 International Workshop on Parallel Processing
Dynamic matching and scheduling algorithms for a multiuser heterogeneous computing environment
Dynamic matching and scheduling algorithms for a multiuser heterogeneous computing environment
Security-Driven Heuristics and A Fast Genetic Algorithm for Trusted Grid Job Scheduling
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Risk-Resilient Heuristics and Genetic Algorithms for Security-Assured Grid Job Scheduling
IEEE Transactions on Computers
Future Generation Computer Systems
Bi-objective scheduling algorithms for optimizing makespan and reliability on heterogeneous systems
Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures
Cross-Layer Collaborative In-Network Processing in Multihop Wireless Sensor Networks
IEEE Transactions on Mobile Computing
Energy minimization with loop fusion and multi-functional-unit scheduling for multidimensional DSP
Journal of Parallel and Distributed Computing
CHINA HPC '07 Proceedings of the 2007 Asian technology information program's (ATIP's) 3rd workshop on High performance computing in China: solution approaches to impediments for high performance computing
The Journal of Supercomputing
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
Scheduling in a dynamic heterogeneous distributed system using estimation error
Journal of Parallel and Distributed Computing
Cost minimization while satisfying hard/soft timing constraints for heterogeneous embedded systems
ACM Transactions on Design Automation of Electronic Systems (TODAES)
Reliability versus performance for critical applications
Journal of Parallel and Distributed Computing
Analyzing scheduling with transient failures
Information Processing Letters
Loop scheduling and bank type assignment for heterogeneous multi-bank memory
Journal of Parallel and Distributed Computing
Energy minimization for heterogeneous wireless sensor networks
Journal of Embedded Computing - Design and Optimization for High Performance Embedded Systems
Reliability-Oriented Genetic Algorithm for Workflow Applications Using Max-Min Strategy
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Energy-Aware Loop Scheduling and Assignment for Multi-Core, Multi-Functional-Unit Architecture
Journal of Signal Processing Systems
A memetic algorithm for reliability-based dynamic scheduling in heterogeneous computing environments
PDCS '07 Proceedings of the 19th IASTED International Conference on Parallel and Distributed Computing and Systems
International Journal of Knowledge-based and Intelligent Engineering Systems
Journal of Parallel and Distributed Computing
Robust task scheduling for volunteer computing systems
The Journal of Supercomputing
Reliability-aware scheduling strategy for heterogeneous distributed computing systems
Journal of Parallel and Distributed Computing
Online energy-saving algorithm for sensor networks in dynamic changing environments
Journal of Embedded Computing
Lifetime reliability-aware task allocation and scheduling for MPSoC platforms
Proceedings of the Conference on Design, Automation and Test in Europe
A task remapping technique for reliable multi-core embedded systems
CODES/ISSS '10 Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Failure-aware workflow scheduling in cluster environments
Cluster Computing
Job-scheduling via resource availability prediction for volunteer computational grids
International Journal of Grid and Utility Computing
Future Generation Computer Systems
Tradeoff exploration between reliability, power consumption, and execution time
SAFECOMP'11 Proceedings of the 30th international conference on Computer safety, reliability, and security
Cloud-DLS: Dynamic trusted scheduling for Cloud computing
Expert Systems with Applications: An International Journal
Comparison of Hyper-DAG based task mapping and scheduling heuristics for wireless sensor networks
ISCIS'05 Proceedings of the 20th international conference on Computer and Information Sciences
Journal of Parallel and Distributed Computing
Online optimization for scheduling preemptable tasks on IaaS cloud systems
Journal of Parallel and Distributed Computing
Journal of Signal Processing Systems
Bayesian Cognitive Model in Scheduling Algorithm for Data Intensive Computing
Journal of Grid Computing
Dynamic trust evaluation and scheduling framework for cloud computing
Security and Communication Networks
Reliability and performance optimization of pipelined real-time systems
Journal of Parallel and Distributed Computing
Distributed workflow mapping algorithm for maximized reliability under end-to-end delay constraint
The Journal of Supercomputing
Load balanced reliable task scheduling algorithm for heterogeneous systems
Journal of High Speed Networks
Hi-index | 0.01 |
In a heterogeneous distributed computing system, machine and network failures are inevitable and can have an adverse effect on applications executing on the system. To reduce the effect of failures on an application executing on a failure-prone system, matching and scheduling algorithms which minimize not only the execution time but also the probability of failure of the application must be devised. However, because of the conflicting requirements, it is not possible to minimize both of the objectives at the same time. Thus, the goal of this paper is to develop matching and scheduling algorithms which account for both the execution time and the reliability of the application. This goal is achieved by modifying an existing matching and scheduling algorithm. The reliability of resources is taken into account using an incremental cost function proposed in this paper and the new algorithm is referred to as the reliable dynamic level scheduling algorithm. The incremental cost function can be defined based on one of the three cost functions developed here. These cost functions are unique in the sense that they are not restricted to tree-based networks and a specific matching and scheduling algorithm. The simulation results confirm that the proposed incremental cost function can be incorporated into matching and scheduling algorithms to produce schedules where the effect of failures of machines and network resources on the execution of the application is reduced and the execution time of the application is minimized as well.