Numprof: a performance analysis framework for numerical libraries

  • Authors:
  • Olli-Pekka Lehto

  • Affiliations:
  • CSC - IT Center for Science Ltd., Espoo, Finland

  • Venue:
  • PARA'12 Proceedings of the 11th international conference on Applied Parallel and Scientific Computing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper introduces Numprof, a profiling framework for performance analysis of numerical libraries. The framework consists of a profiler and replayer for the BLAS and FFTW3 libraries. The profiler records library call events with a user configurable amount of detail. The replayer can be used to execute library calls based on the profiling trace files generated by the profiler. We explore real-world use cases for the framework and demonstrate that due to its low overhead it is feasible to be used for continuous statistical analysis of numerical library calls.