Performance of memory reclamation for lockless synchronization

  • Authors:
  • Thomas E. Hart;Paul E. McKenney;Angela Demke Brown;Jonathan Walpole

  • Affiliations:
  • Department of Computer Science, University of Toronto, Toronto, Ont., Canada M5S 2E4;IBM Linux Technology Center, IBM Beaverton, Beaverton, OR 97006, USA;Department of Computer Science, University of Toronto, Toronto, Ont., Canada M5S 2E4;Department of Computer Science, Portland State University, Portland, OR 97207-0751, USA

  • Venue:
  • Journal of Parallel and Distributed Computing
  • Year:
  • 2007

Quantified Score

Hi-index 0.03

Visualization

Abstract

Achieving high performance for concurrent applications on modern multiprocessors remains challenging. Many programmers avoid locking to improve performance, while others replace locks with non-blocking synchronization to protect against deadlock, priority inversion, and convoying. In both cases, dynamic data structures that avoid locking require a memory reclamation scheme that reclaims elements once they are no longer in use. The performance of existing memory reclamation schemes has not been thoroughly evaluated. We conduct the first fair and comprehensive comparison of three recent schemes-quiescent-state-based reclamation, epoch-based reclamation, and hazard-pointer-based reclamation-using a flexible microbenchmark. Our results show that there is no globally optimal scheme. When evaluating lockless synchronization, programmers and algorithm designers should thus carefully consider the data structure, the workload, and the execution environment, each of which can dramatically affect the memory reclamation performance. We discuss the consequences of our results for programmers and algorithm designers. Finally, we describe the use of one scheme, quiescent-state-based reclamation, in the context of an OS kernel-an execution environment which is well suited to this scheme.