Grid result checking

Authors:
Cécile Germain-Renaud;Dephine Monnier-Ragaigne
Affiliations:
Laboratoire de Recherche en Informatique;Laboratoire de l'Accélérateur Linéaire
Venue:
Proceedings of the 2nd conference on Computing frontiers
Year:
2005

Citing 13
Cited 3

Designing programs that check their work

STOC '89 Proceedings of the twenty-first annual ACM symposium on Theory of computing
Software reliability via run-time result-checking

Journal of the ACM (JACM)
Property testing and its connection to learning and approximation

Journal of the ACM (JACM)
SETI@HOME—massively distributed computing for SETI

Computing in Science and Engineering
Sabotage-tolerance mechanisms for volunteer computing systems

Future Generation Computer Systems - Best papers from symp. on cluster computing and the grid (CCGRID 2001)
Grid Services for Distributed System Integration

Computer
Global Computing Systems

LSSC '01 Proceedings of the Third International Conference on Large-Scale Scientific Computing-Revised Papers
Uncheatable Distributed Computations

CT-RSA 2001 Proceedings of the 2001 Conference on Topics in Cryptology: The Cryptographer's Track at RSA
Result checking in global computing systems

ICS '03 Proceedings of the 17th annual international conference on Supercomputing
XtremWeb: A Generic Global Computing System

CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
Hardening Functions for Large Scale Distributed Computations

SP '03 Proceedings of the 2003 IEEE Symposium on Security and Privacy
Securing distributed computing against the hostile host

ACSC '04 Proceedings of the 27th Australasian conference on Computer science - Volume 26
The Anatomy of the Grid: Enabling Scalable Virtual Organizations

International Journal of High Performance Computing Applications

Multi-scale Real-Time Grid Monitoring with Job Stream Mining

CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Group-based adaptive result certification mechanism in Desktop Grids

Future Generation Computer Systems
Convergence analysis of evolutionary algorithms in the presence of crash-faults and cheaters

Computers & Mathematics with Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Result checking is the theory and practice of proving that the result of an execution of a program on an input is correct. Result checking has most often been envisioned in the framework of program testing or property testing, where the issue is the conformity of the program to some a-priori specification. Very large scale distributed computing systems demand to tackle the issue of computation correctness, albeit from hypothesis very different from the program testing ones. The general issues examined in this paper are the following. First, the definition of checking methods adapted to large-scale Monte-Carlo simulations; for these applications, no external criterion can be used to assess the quality of the result. Second, two result checking algorithms which minimize the overall overhead through an adaptive strategy. Finally, a specialization of this framework to a case study, the Auger astrophysics experiment. Our main contributions are: first to focus on checking Monte-Carlo simulations, which have rarely been considered previously; second to define a probabilistic checking strategy including the risk of first kind (false positive) as well as the risk of second kind (false negative) which is usually the only one considered, and which is compatible with Byzantine saboteurs; third, to exploit the probable characteristics of the behaviour of the saboteurs to optimise for the most frequent case. Finally, we show on a case study that the implementation details can be carried out