TAUmon: scalable online performance data analysis in TAU

  • Authors:
  • Chee Wai Lee;Allen D. Malony;Alan Morris

  • Affiliations:
  • Department Computer and Information Science, University Oregon, Eugene, Oregon;Department Computer and Information Science, University Oregon, Eugene, Oregon;Department Computer and Information Science, University Oregon, Eugene, Oregon

  • Venue:
  • Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, we present an update on the scalable online support for performance data analysis and monitoring in TAU. Extending on our prior work with TAUoverSupermon and TAUoverMRNet, we show how online analysis operations can also be supported directly and scalably using the parallel infrastructure provided by an MPI application instrumented with TAU. We also report on efforts to streamline and update TAUoverMRNet. Together, these approaches form the basis for the investigation of online analysis capabilities in a TAU monitoring framework TAUmon. We discuss various analysis operations and capabilities enabled by online monitoring and how operations like event unification enable merged profiles to be produced with greatly reduced data volume prior to application shutdown. Scaling results with PFLOTRAN on the Cray XT5 and BG/P are presented along with a look at some initial performance information generated from FLASH through our TAUmon prototype frameworks.