Lightweight logging and recovery for distributed shared memory over virtual interface architecture

  • Authors:
  • Soyeon Park;Youngjae Kim;Seung Ryoul Maeng

  • Affiliations:
  • Department of Electrical Engineering and Computer Science, Korea Advanced Institute of Science and Technology;Department of Electrical Engineering and Computer Science, Korea Advanced Institute of Science and Technology;Department of Electrical Engineering and Computer Science, Korea Advanced Institute of Science and Technology

  • Venue:
  • ISPDC'03 Proceedings of the Second international conference on Parallel and distributed computing
  • Year:
  • 2003

Quantified Score

Hi-index 0.01

Visualization

Abstract

As software Distributed Shared Memory(DSM) systems become attractive on larger clusters, the focus of attention moves toward improving the reliability of systems. In this paper, we propose a lightweight logging scheme, called remote logging, and a recovery protocol for home-based DSM. Remote logging stores coherence-related data to the volatile memory of a remote node. The logging overhead can be moderated with high-speed system area network and user-level DMA operations supported by modern communication protocols. Remote logging tolerates multiple failures if the backup nodes of failed nodes are alive. It makes the reliability of DSM grow much higher. Experimental results show that our fault-tolerant DSM has low overhead compared to conventional stable logging and it can be effectively recovered from some concurrent failures.