PYTHIA-II: a knowledge/database system for managing performance data and recommending scientific software

  • Authors:
  • Elias N. Houstis;Ann C. Catlin;John R. Rice;Vassilios S. Verykios;Naren Ramakrishnan;Catherine E. Houstis

  • Affiliations:
  • Purdue Univ., West Lafayette, IN;Purdue Univ., West Lafayette, IN;Purdue Univ., West Lafayette, IN;Drexel Univ., Philadelphia, PA;Virginia Tech, Blacksburg, VA;Univ. of Crete, Heraklion, Greece

  • Venue:
  • ACM Transactions on Mathematical Software (TOMS) - Special issue in honor of John Rice's 65th birthday
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Often scientists need to locate appropriate software for their problems and then select from among many alternatives. We have previously proposed an approach for dealing with this task by processing performance data of the targeted software. This approach has been tested using a customized implementation referred to as PYTHIA. This experience made us realize the complexity of the algorithmic discovery of knowledge from performance data and of the management of these data together with the discovered knowledge. To address this issue, we created PYTHIA-II—a modular framework and system which combines a general knowledge discovery in databases (KDD) methodology and recommender system technologies to provide advice about scientific software/hardware artifacts. The functionality and effectiveness of the system is demonstrated for two existing performance studies using sets of software for solving partial differential equations. From the end-user perspective, PYTHIA-II allows users to specify the problem to be solved and their computational objectives. In turn, PYTHIA-II (i) selects the software available for the user's problem (ii) suggests parameter values, and (iii) assesses the recommendation provided. PYTHIA-II provides all the necessary facilities to set up database schemas for testing suites and associated performance data in order to test sets of software. Moreover, it allows easy interfacing of alternative data mining and recommendation facilities. PYTHIA-II is an open-ended system implemented on public domain software and has been used for performance evaluation in several different problem domains.