RRBS: a fault tolerance model for cluster/grid parallel file system

  • Authors:
  • Yan-mei Huo;Jiu-bin Ju;Liang Hu

  • Affiliations:
  • Department of Computer Science & Technology, Jilin University, Changchun, China;Department of Computer Science & Technology, Jilin University, Changchun, China;Department of Computer Science & Technology, Jilin University, Changchun, China

  • Venue:
  • ISPA'05 Proceedings of the Third international conference on Parallel and Distributed Processing and Applications
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Parallel file systems stripe the data from a single file across multiple cluster/grid nodes so that the systems can access file in parallel. In such a system, if an I/O node or the storage device of that node doesn’t work, all the subfiles on the node can’t be accessed. In this paper, we introduce a special fault tolerance model for parallel file systems called Round-robin Redundant Backup of Subfile (RRBS). This model ensures the accessibility of the parallel files even when an I/O node is failure. In order to test the usability of RRBS, we also developed a prototype of parallel file system called WPFS on a PC/Windows cluster.