A hybrid fault tolerance technique in grid computing system

  • Authors:
  • Kalim Qureshi;Fiaz Gul Khan;Paul Manuel;Babar Nazir

  • Affiliations:
  • Information Science Dept., Kuwait University, Kuwait City, Kuwait;COMSATS Institute of Information Technology, Abbottabad, Pakistan;Information Science Dept., Kuwait University, Kuwait City, Kuwait;COMSATS Institute of Information Technology, Abbottabad, Pakistan

  • Venue:
  • The Journal of Supercomputing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In order to achieve high level of reliability and availability, the grid infrastructure should be a foolproof fault tolerant. Fault tolerance plays a key role in order to assert availability and reliability of a grid system. Since the failure of resources affects job execution fatally, fault tolerance service is essential to satisfy QoS requirement in grid computing.In this paper we proposed two hybrid fault tolerance techniques (FTTs) that are called alternate task with checkpoint and alternate task with retry. These proposed hybrid FTTs inherit the good features and overcome the limitations of workflow level FTT and task level FTT. We evaluate the performance of our proposed FTTs under different experimental environments. Finally, we conclude that alternate task with checkpoint improves the reliability of a grid system more significantly than alternate task with retry.