FASE: A Framework for Scalable Performance Prediction of HPC Systems and Applications

  • Authors:
  • Eric Grobelny;David Bueno;Ian Troxel;Alan D. George;Jeffrey S. Vetter

  • Affiliations:
  • High-performance Computing and Simulation (HCS) Research Laboratory University of Florida Gainesville FL 32611, USA;High-performance Computing and Simulation (HCS) Research Laboratory University of Florida Gainesville FL 32611, USA;High-performance Computing and Simulation (HCS) Research Laboratory University of Florida Gainesville FL 32611, USA;High-performance Computing and Simulation (HCS) Research Laboratory University of Florida Gainesville FL 32611, USA;Future Technologies Group, Computer Science and Mathematics Division Oak Ridge National Laboratory Oak Ridge TN 37831, USA

  • Venue:
  • Simulation
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

As systems of computers become more compleX in terms of their architecture, interconnect and heterogeneity, the optimum configuration and utilization of these machines becomes a major challenge. To reduce the penalties caused by poorly configured systems, simulation is often used to predict the performance of key applications to be eXecuted on the new systems. Simulation provides the capability to observe component and system characteristics (e.g. performance and power) in order to make vital design decisions. However, simulating high-fidelity models can be very time consuming and even prohibitive when evaluating large-scale systems. The Fast and Accurate Simulation Environment (FASE) framework seeks to support large-scale system simulation by using high-fidelity models to capture the behavior of only the performance-critical components while employing abstraction techniques to capture the effects of those components with little impact on the system. In order to achieve this balance of accuracy and simulation speed, FASE provides a methodology and associated toolset to evaluate numerous architectural options. This approach allows users to make system design decisions based on quantifiable demands of their key applications rather than using manual analysis which can be error prone and impractical for large systems. The framework accomplishes this evaluation through a novel approach of combining discrete-event simulation with an application characterization scheme in order to remove unnecessary details while focusing on components critical to the performance of the application. In this paper, we present the methodology and techniques behind FASE and include several case studies validating systems constructed using various applications and interconnects.