Inducing diverse decision forests with genetic programming

  • Authors:
  • Jan Suchý;Jiří Kubalík

  • Affiliations:
  • Department of Cybernetics, CTU Prague, Praha 2, Czech Republic;Department of Cybernetics, CTU Prague, Praha 2, Czech Republic

  • Venue:
  • EuroGP'05 Proceedings of the 8th European conference on Genetic Programming
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an algorithm for induction of ensembles of decision trees, also referred to as decision forests. In order to achieve high expressiveness the trees induced are multivariate, with various, possibly user-defined tests in their internal nodes. Strongly typed genetic programming is utilized to evolve structure of the tests. Special attention is given to the problem of diversity of the forest constructed. An approach is proposed, which explicitly encourages the induction algorithm to produce a different tree each run, which represents an alternative description of the data. It is shown that forests constructed this way have significantly reduced classification error even for small forest size, compared to other ensemble methods. Classification accuracy is also compared to other recent methods on several real-world datasets.