PVM: a framework for parallel distributed computing
Concurrency: Practice and Experience
Genetic programming: on the programming of computers by means of natural selection
Genetic programming: on the programming of computers by means of natural selection
Interfacing Condor and PVM to harness the cycles of workstation clusters
Future Generation Computer Systems - Special issue: resource management in distributed systems
Parallel genetic programming: a scalable implementation using the transputer network architecture
Advances in genetic programming
Genetic programming: an introduction: on the automatic evolution of computer programs and its applications
Fundamentals of fault-tolerant distributed computing in asynchronous environments
ACM Computing Surveys (CSUR)
Foundations of genetic programming
Foundations of genetic programming
A survey of rollback-recovery protocols in message-passing systems
ACM Computing Surveys (CSUR)
Reliability of Computer Systems and Networks: Fault Tolerance,Analysis,and Design
Reliability of Computer Systems and Networks: Fault Tolerance,Analysis,and Design
SETI@home: an experiment in public-resource computing
Communications of the ACM
Managing Checkpoints for Parallel Programs
IPPS '96 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
CAGE: A Tool for Parallel Genetic Programming Applications
EuroGP '01 Proceedings of the 4th European Conference on Genetic Programming
General schema theory for genetic programming with subtree-swapping crossover: part I
Evolutionary Computation
CALYPSO: a novel software system for fault-tolerant parallel processing on distributed platforms
HPDC '95 Proceedings of the 4th IEEE International Symposium on High Performance Distributed Computing
Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations
HPDC '99 Proceedings of the 8th IEEE International Symposium on High Performance Distributed Computing
Maintaining Connectivity in a Scalable and Robust Distributed Environment
CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
General schema theory for genetic programming with subtree-swapping crossover: Part II
Evolutionary Computation
Building with paradisEO reusable parallel and distributed evolutionary algorithms
Parallel Computing - Special issue: Parallel and nature-inspired computational paradigms and applications
BOINC: A System for Public-Resource Computing and Storage
GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
Spatially Structured Evolutionary Algorithms: Artificial Evolution in Space and Time (Natural Computing Series)
A Performability-Oriented Software Rejuvenation Framework for Distributed Applications
DSN '05 Proceedings of the 2005 International Conference on Dependable Systems and Networks
A Fault Tolerant Optimization Algorithm based on Evolutionary Computation
DEPCOS-RELCOMEX '06 Proceedings of the International Conference on Dependability of Computer Systems
Reliability challenges in large systems
Future Generation Computer Systems
The Computational and Storage Potential of Volunteer Computing
CCGRID '06 Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid
A large-scale study of failures in high-performance computing systems
DSN '06 Proceedings of the International Conference on Dependable Systems and Networks
Grid computing for parallel bioinspired algorithms
Journal of Parallel and Distributed Computing - Special issue on parallel bioinspired algorithms
Blocking vs. non-blocking coordinated checkpointing for large-scale fault tolerant MPI
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Characterizing resource availability in enterprise desktop grids
Future Generation Computer Systems
Population variation in genetic programming
Information Sciences: an International Journal
Is the island model fault tolerant?
Proceedings of the 9th annual conference on Genetic and evolutionary computation
Dynamic population variation in genetic programming
Information Sciences: an International Journal
Characterizing fault tolerance in genetic programming
BADS '09 Proceedings of the 2009 workshop on Bio-inspired algorithms for distributed systems
Population implosion in genetic programming
GECCO'03 Proceedings of the 2003 international conference on Genetic and evolutionary computation: PartII
Parallelism and evolutionary algorithms
IEEE Transactions on Evolutionary Computation
Convergence analysis of evolutionary algorithms in the presence of crash-faults and cheaters
Computers & Mathematics with Applications
Hi-index | 0.00 |
Evolutionary algorithms, including genetic programming (GP), are frequently employed to solve difficult real-life problems, which can require up to days or months of computation. An approach for reducing the time-to-solution is to use parallel computing on distributed platforms. Large platforms such as these are prone to failures, which can even be commonplace events rather than rare occurrences. Thus, fault tolerance and recovery techniques are typically necessary. The aim of this article is to show the inherent ability of parallel GP to tolerate failures in distributed platforms without using any fault-tolerant technique. This ability is quantified via simulation experiments performed using failure traces from real-world distributed platforms, namely, desktop grids, for two well-known problems.