Simplification methods for model trees with regression and splitting nodes

  • Authors:
  • Michelangelo Ceci;Annalisa Apice;Donato Malerba

  • Affiliations:
  • Dipartimento di Informatica, Università degli Studi, Bari, Italy;Dipartimento di Informatica, Università degli Studi, Bari, Italy;Dipartimento di Informatica, Università degli Studi, Bari, Italy

  • Venue:
  • MLDM'03 Proceedings of the 3rd international conference on Machine learning and data mining in pattern recognition
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Model trees are tree-based regression models that associate leaves with linear regression models. A new method for the stepwise induction of model trees (SMOTI) has been developed. Its main characteristic is the construction of trees with two types of nodes: regression nodes, which perform only straight-line regression, and splitting nodes, which partition the feature space. In this way, internal regression nodes contribute to the definition of multiple linear models and have a "global" effect, while straight-line regressions at leaves have only "local" effects. In this paper the problem of simplifying model trees with both regression and splitting nodes is faced. In particular two methods, named Reduced Error Pruning (REP) and Reduced Error Grafting (REG), are proposed. They are characterized by the use of an independent pruning set. The effect of the simplification on model trees induced with SMOTI is empirically investigated. Results are in favour of simplified trees in most cases.