Automatic NUMA characterization using Cbench

  • Authors:
  • Ryan K. Braithwaite;Wu-chun Feng;Patrick S. McCormick

  • Affiliations:
  • Los Alamos National Laboratory, Los Alamos, NM, USA;Virginia Tech, Blacksburg, VA, USA;Los Alamos National Laboratory, Los Alamos, NM, USA

  • Venue:
  • ICPE '12 Proceedings of the 3rd ACM/SPEC International Conference on Performance Engineering
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clusters of seemingly homogeneous compute nodes are increasingly heterogeneous within each node due to replication and distribution of node-level subsystems. This intra-node heterogeneity can adversely affect program execution performance by inflicting additional data-access costs when accessing non-local data. In this work-in-progress paper, we present extensions to the Cbench Scalable Testing Framework for analyzing main memory and PCIe data-access performance in modern NUMA architectures. The information provided by this tool will be of use for task scheduling, performance modeling, and evaluation of NUMA systems.