Data mining using unguided symbolic regression on a blast furnace dataset

  • Authors:
  • Michael Kommenda;Gabriel Kronberger;Christoph Feilmayr;Michael Affenzeller

  • Affiliations:
  • Heuristic and Evolutionary Algorithms Laboratory, School of Informatics, Communications and Media, Upper Austria University of Applied Sciences, Hagenberg, Austria;Heuristic and Evolutionary Algorithms Laboratory, School of Informatics, Communications and Media, Upper Austria University of Applied Sciences, Hagenberg, Austria;Voestalpine Stahl GmbH, Linz, Austria;Heuristic and Evolutionary Algorithms Laboratory, School of Informatics, Communications and Media, Upper Austria University of Applied Sciences, Hagenberg, Austria

  • Venue:
  • EvoApplications'11 Proceedings of the 2011 international conference on Applications of evolutionary computation - Volume Part I
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper a data mining approach for variable selection and knowledge extraction from datasets is presented. The approach is based on unguided symbolic regression (every variable present in the dataset is treated as the target variable in multiple regression runs) and a novel variable relevance metric for genetic programming. The relevance of each input variable is calculated and a model approximating the target variable is created. The genetic programming configurations with different target variables are executed multiple times to reduce stochastic effects and the aggregated results are displayed as a variable interaction network. This interaction network highlights important system components and implicit relations between the variables. The whole approach is tested on a blast furnace dataset, because of the complexity of the blast furnace and the many interrelations between the variables. Finally the achieved results are discussed with respect to existing knowledge about the blast furnace process.