WBTK: a New Set of Microbenchmarks to Explore Memory System Performance for Scientific Computing

  • Authors:
  • W. Jalby;C. Lemuet;X. Le Pasteur

  • Affiliations:
  • PRISM LABORATORY, UNIVERSITY OF VERSAILLES, FRANCE;PRISM LABORATORY, UNIVERSITY OF VERSAILLES, FRANCE;PRISM LABORATORY, UNIVERSITY OF VERSAILLES, FRANCE

  • Venue:
  • International Journal of High Performance Computing Applications
  • Year:
  • 2004

Quantified Score

Hi-index 0.01

Visualization

Abstract

Memory hierarchies are a key component in obtaining high performance on modern microprocessors. To satisfy the ever-increasing demand on data rate access, they are also becoming increasingly complex: multilevel caches, non-blocking caches, sophisticated instructions for supporting prefetch and cache control, etc. If all of these advanced features promise to offer large performance gains, they also generate in some cases performance "anomalies" (i.e. bad performance triggered by specific code patterns). For precisely locating and understanding these anomalies, a new set of microbenchmarks called WBTK is introduced. We show through systematic experimentation on Alpha 21264, Power4 and Itanium1 that this microbenchmark first allowed us to detect most of the anomalies encountered on simple BLAS1 type codes. Secondly, it led us to demonstrate that vectorization of memory access was an efficient workaround for most of these anomalies.