Evolutionary model trees for handling continuous classes in machine learning

  • Authors:
  • Rodrigo C. Barros;Duncan D. Ruiz;Márcio P. Basgalupp

  • Affiliations:
  • University of São Paulo, Av. Trabalhador São Carlense 400, 13560-970 São Carlos, SP, Brazil;Pontifical Catholic University of Rio Grande do Sul, Av. Ipiranga 6681, 90619-900 Porto Alegre, RS, Brazil;Instituto de Ciência e Tecnologia, Universidade Federal de São Paulo, Rua Talim 330, 12231-280 São José dos Campos, SP, Brazil

  • Venue:
  • Information Sciences: an International Journal
  • Year:
  • 2011

Quantified Score

Hi-index 0.08

Visualization

Abstract

Model trees are a particular case of decision trees employed to solve regression problems. They have the advantage of presenting an interpretable output, helping the end-user to get more confidence in the prediction and providing the basis for the end-user to have new insight about the data, confirming or rejecting hypotheses previously formed. Moreover, model trees present an acceptable level of predictive performance in comparison to most techniques used for solving regression problems. Since generating the optimal model tree is an NP-Complete problem, traditional model tree induction algorithms make use of a greedy top-down divide-and-conquer strategy, which may not converge to the global optimal solution. In this paper, we propose a novel algorithm based on the use of the evolutionary algorithms paradigm as an alternate heuristic to generate model trees in order to improve the convergence to globally near-optimal solutions. We call our new approach evolutionary model tree induction (E-Motion). We test its predictive performance using public UCI data sets, and we compare the results to traditional greedy regression/model trees induction algorithms, as well as to other evolutionary approaches. Results show that our method presents a good trade-off between predictive performance and model comprehensibility, which may be crucial in many machine learning applications.