Decision trees and MPI collective algorithm selection problem

  • Authors:
  • Jelena Pješivac-Grbović;George Bosilca;Graham E. Fagg;Thara Angskun;Jack J. Dongarra

  • Affiliations:
  • Innovative Computing Laboratory, The University of Tennessee Computer Science Department, Knoxville, TN;Innovative Computing Laboratory, The University of Tennessee Computer Science Department, Knoxville, TN;Innovative Computing Laboratory, The University of Tennessee Computer Science Department, Knoxville, TN;Innovative Computing Laboratory, The University of Tennessee Computer Science Department, Knoxville, TN;Innovative Computing Laboratory, The University of Tennessee Computer Science Department, Knoxville, TN

  • Venue:
  • Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Selecting the close-to-optimal collective algorithm based on the parameters of the collective call at run time is an important step for achieving good performance of MPI applications. In this paper, we explore the applicability of C4.5 decision trees to the MPI collective algorithm selection problem. We construct C4.5 decision trees from the measured algorithm performance data and analyze both the decision tree properties and the expected run time performance penalty. In cases we considered, results show that the C4.5 decision trees can be used to generate a reasonably small and very accurate decision function. For example, the broadcast decision tree with only 21 leaves was able to achieve a mean performance penalty of 2.08%. Similarly, combining experimental data for reduce and broadcast and generating a decision function from the combined decision trees resulted in less than 2.5% relative performance penalty. The results indicate that C4.5 decision trees are applicable to this problem and should be more widely used in this domain.