Adapting grid applications to safety using fault-tolerant methods: Design, implementation and evaluations

  • Authors:
  • Xuanhua Shi;Jean-Louis Pazat;Eric Rodriguez;Hai Jin;Hongbo Jiang

  • Affiliations:
  • CGCL/SCTS, School Comput., Huazhong Univ. Sci. & Tech., Wuhan, 430074, China;IRISA/INSA de Rennes, Campus de Beaulieu, 35042 Rennes, France;CEDRAT, 15 Chemin de Malacher-Inovallée, 38246 MEYLAN, France;CGCL/SCTS, School Comput., Huazhong Univ. Sci. & Tech., Wuhan, 430074, China;EIE Department, Huazhong Univ. Sci. & Tech., Wuhan, 430074, China

  • Venue:
  • Future Generation Computer Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Grid applications have been prone to encountering problems such as failures or malicious attacks during execution in recent years, due to their distributed and large-scale features. The application itself, however, has limited power to address these problems. This paper presents the design, implementation, and evaluation of an adaptive framework- Dynasa, which strives to handle security problems using adaptive fault-tolerance (i.e., checkpointing and replication) during the execution of applications according to the status of the Grid environments. We evaluate our adaptive framework experimentally using the Grid5000 testbed and the experimental results have demonstrated that Dynasa enables the application itself to handle the security problems efficiently. The starting of the adaptive component is less than 1 s and the adaptive action is less than 0.1 s with the checkpoint interval of 20 s. Compared with non-adaptive method, experimental results demonstrate that Dynasa achieves better performance in terms of execution time, network bandwidth consumed, and CPU load, resulting in up to a 50% lower overhead.