Genetic programming: on the programming of computers by means of natural selection
Genetic programming: on the programming of computers by means of natural selection
Foundations of genetic programming
Foundations of genetic programming
Machine Learning
The Role of Occam‘s Razor in Knowledge Discovery
Data Mining and Knowledge Discovery
Fighting Bloat with Nonparametric Parsimony Pressure
PPSN VII Proceedings of the 7th International Conference on Parallel Problem Solving from Nature
Lexicographic Parsimony Pressure
GECCO '02 Proceedings of the Genetic and Evolutionary Computation Conference
Proceedings of the European Conference on Genetic Programming
Genetic programming for human oral bioavailability of drugs
Proceedings of the 8th annual conference on Genetic and evolutionary computation
Proceedings of the 9th annual conference on Genetic and evolutionary computation
The impact of population size on code growth in GP: analysis and empirical validation
Proceedings of the 10th annual conference on Genetic and evolutionary computation
Generalized Curvatures
Genetic Programming and Evolvable Machines
Operator equalisation, bloat and overfitting: a study on human oral bioavailability prediction
Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Generality versus size in genetic programming
GECCO '96 Proceedings of the 1st annual conference on Genetic and evolutionary computation
IEEE Transactions on Evolutionary Computation
Improving symbolic regression with interval arithmetic and linear scaling
EuroGP'03 Proceedings of the 6th European conference on Genetic programming
A simple but theoretically-motivated method to control bloat in genetic programming
EuroGP'03 Proceedings of the 6th European conference on Genetic programming
On the limiting distribution of program sizes in tree-based genetic programming
EuroGP'07 Proceedings of the 10th European conference on Genetic programming
Operator equalisation and bloat free GP
EuroGP'08 Proceedings of the 11th European conference on Genetic programming
Crossover, sampling, bloat and the harmful effects of size limits
EuroGP'08 Proceedings of the 11th European conference on Genetic programming
A Field Guide to Genetic Programming
A Field Guide to Genetic Programming
Proceedings of the 12th annual conference companion on Genetic and evolutionary computation
Open issues in genetic programming
Genetic Programming and Evolvable Machines
Variance based selection to improve test set performance in genetic programming
Proceedings of the 13th annual conference on Genetic and evolutionary computation
Predicting problem difficulty for genetic programming applied to data classification
Proceedings of the 13th annual conference on Genetic and evolutionary computation
Overfitting detection and adaptive covariant parsimony pressure for symbolic regression
Proceedings of the 13th annual conference companion on Genetic and evolutionary computation
A quantitative study of learning and generalization in genetic programming
EuroGP'11 Proceedings of the 14th European conference on Genetic programming
An empirical study of functional complexity as an indicator of overfitting in genetic programming
EuroGP'11 Proceedings of the 14th European conference on Genetic programming
Estimating classifier performance with genetic programming
EuroGP'11 Proceedings of the 14th European conference on Genetic programming
Validation sets for evolutionary curtailment with improved generalisation
ICHIT'11 Proceedings of the 5th international conference on Convergence and hybrid information technology
Random sampling technique for overfitting control in genetic programming
EuroGP'12 Proceedings of the 15th European conference on Genetic Programming
Operator equalisation for bloat free genetic programming and a survey of bloat control methods
Genetic Programming and Evolvable Machines
A new methodology for the GP theory toolbox
Proceedings of the 14th annual conference on Genetic and evolutionary computation
Where should we stop? an investigation on early stopping for GP learning
SEAL'12 Proceedings of the 9th international conference on Simulated Evolution and Learning
Bootstrapping to reduce bloat and improve generalisation in genetic programming
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
A bootstrapping approach to reduce over-fitting in genetic programming
Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
Hi-index | 0.00 |
Recent contributions clearly show that eliminating bloat in a genetic programming system does not necessarily eliminate overfitting and vice-versa. This fact seems to contradict a common agreement of many researchers known as the minimum description length principle, which states that the best model is the one that minimizes the amount of information needed to encode it. Another common agreement is that overfitting should be, in some sense, related to the functional complexity of the model. The goal of this paper is to define three measures to respectively quantify bloat, overfitting and functional complexity of solutions and show their suitability on a set of test problems including a simple bidimensional symbolic regression test function and two real-life multidimensional regression problems. The experimental results are encouraging and should pave the way to further investigation. Advantages and drawbacks of the proposed measures are discussed, and ways to improve them are suggested. In the future, these measures should be useful to study and better understand the relationship between bloat, overfitting and functional complexity of solutions.