Performance analysis using the MIPS R10000 performance counters

  • Authors:
  • Marco Zagha;Brond Larson;Steve Turner;Marty Itzkowitz

  • Affiliations:
  • Silicon Graphics Inc., 2011 N. Shoreline Blvd., Mountain View, CA;Silicon Graphics Inc., 2011 N. Shoreline Blvd., Mountain View, CA;Silicon Graphics Inc., 2011 N. Shoreline Blvd., Mountain View, CA;Silicon Graphics Inc., 2011 N. Shoreline Blvd., Mountain View, CA

  • Venue:
  • Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
  • Year:
  • 1996

Quantified Score

Hi-index 0.01

Visualization

Abstract

Tuning supercomputer application performance often requires analyzing the interaction of the application and the underlying architecture. In this paper, we describe support in the MIPS R10000 for non-intrusively monitoring a variety of processor events -- support that is particularly useful for characterizing the dynamic behavior of multi-level memory hierarchies, hardware-based cache coherence, and speculative execution. We first explain how performance data is collected using an integrated set of hardware mechanisms, operating system abstractions, and performance tools. We then describe several examples drawn from scientific applications that illustrate how the counters and profiling tools provide information that helps developers analyze and tune applications.