Performance evaluation of parallel systems employing roll-forward checkpoint schemes

  • Authors:
  • Gyung-Leen Park;Hee Yong Youn;Junghoon Lee;Chul Soo Kim;Bongkyu Lee;Sang Joon Lee;Wang-Cheol Song;Yung-Cheol Byun

  • Affiliations:
  • Dept. of Computer Science and Statistics, Cheju National University, Cheju, Korea;School of Information and Communications Engineering, Sungkyunkwan University, Suwon, Korea;Dept. of Computer Science and Statistics, Cheju National University, Cheju, Korea;Dept. of Computer Science and Statistics, Cheju National University, Cheju, Korea;Dept. of Computer Science and Statistics, Cheju National University, Cheju, Korea;Faculty of Telecommunication and Computer Engineering, Cheju National University, Cheju, Korea;Faculty of Telecommunication and Computer Engineering, Cheju National University, Cheju, Korea;Faculty of Telecommunication and Computer Engineering, Cheju National University, Cheju, Korea

  • Venue:
  • ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

High performance and reliability are the main goals of parallel and distributed computing systems. To increase the performance and reliability of the systems, various checkpoint schemes have been proposed in the literature for decades. However, the lack of general analytical models has been an obstacle to compare the performance of systems employing different checkpoint schemes. This paper develops an analytical model to evaluate the relative response time of systems employing checkpoint schemes. The model has been applied to evaluate the relative response time of systems employing RFC (Roll-Forward Checkpoint), DMR-F (Double Modular Redundancy for Forward recovery), and DST (Duplex with Self-Test) schemes. The result shows the feasibility of the model developed in the paper.