Analysis of the memory registration process in the mellanox infiniband software stack

  • Authors:
  • Frank Mietke;Robert Rex;Robert Baumgartl;Torsten Mehlan;Torsten Hoefler;Wolfgang Rehm

  • Affiliations:
  • Department of Computer Science, Chemnitz University of Technology, Germany;Department of Computer Science, Chemnitz University of Technology, Germany;Department of Computer Science, Chemnitz University of Technology, Germany;Department of Computer Science, Chemnitz University of Technology, Germany;Department of Computer Science, Chemnitz University of Technology, Germany;Department of Computer Science, Chemnitz University of Technology, Germany

  • Venue:
  • Euro-Par'06 Proceedings of the 12th international conference on Parallel Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

To leverage high speed interconnects like InfiniBand it is important to minimize the communication overhead. The most interfering overhead is the registration of communication memory. In this paper, we present our analysis of the memory registration process inside the Mellanox InfiniBand driver and possible ways out of this bottleneck. We evaluate and characterize the most time consuming parts in the execution path of the memory registration function using the Read Time Stamp Counter (RDTSC) instruction. We present measurements on AMD Opteron and Intel Xeon systems with different types of Host Channel Adapters for PCI-X and PCI-Express. Finally, we conclude with first results using Linux hugepage support to shorten the time of registering a memory region.