Genetic programming: on the programming of computers by means of natural selection
Genetic programming: on the programming of computers by means of natural selection
An introduction to variable and feature selection
The Journal of Machine Learning Research
Genetic Programming and Evolvable Machines
Knowledge mining with genetic programming methods for variable selection in flavor design
Proceedings of the 12th annual conference on Genetic and evolutionary computation
Two fast tree-creation algorithms for genetic programming
IEEE Transactions on Evolutionary Computation
Hi-index | 0.00 |
In this paper a data mining approach for variable selection and knowledge extraction from datasets is presented. The approach is based on unguided symbolic regression (every variable present in the dataset is treated as the target variable in multiple regression runs) and a novel variable relevance metric for genetic programming. The relevance of each input variable is calculated and a model approximating the target variable is created. The genetic programming configurations with different target variables are executed multiple times to reduce stochastic effects and the aggregated results are displayed as a variable interaction network. This interaction network highlights important system components and implicit relations between the variables. The whole approach is tested on a blast furnace dataset, because of the complexity of the blast furnace and the many interrelations between the variables. Finally the achieved results are discussed with respect to existing knowledge about the blast furnace process.