Wrapper discretization by means of estimation of distribution algorithms

  • Authors:
  • J. L. Flores;I. Inza;P. Larrañ/aga

  • Affiliations:
  • (Correspd. Tel.: +34 943018000/ Fax: +34 943015590/ E-mail: joseluis.flores@si.ehu.es) Intell. Sys. Grp., Dept. of Comp. Sci. and Artif. Intell., Univ. of The Basque Country, P.O. Box 649, 20080 D ...;Intelligent Systems Group, Dept. of Comp. Sci. and Artif. Intell., University of The Basque Country, P.O. Box 649, 20080 Donostia, San Sebastiá/n, Spain. E-mail: inza@si.ehu.es;Intelligent Systems Group, Department of Computer Science and Artificial Intelligence, University of The Basque Country, P.O. Box 649, 20080 Donostia, San Sebastiá/n, Spain. E-mail: ccplamup@s ...

  • Venue:
  • Intelligent Data Analysis
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a supervised wrapper approach to discretization. In contrast to many classical approaches, the discretization process is multivariate: all variables are discretized simultaneously, and the proposed discretization is evaluated with the Naive-Bayes classifier. The search for the optimal discretization is carried out as an optimization process with the learning model estimated accuracy guiding it. The global optimization algorithm is based on estimation of distribution algorithms, a set of novel algorithms which are special kinds of evolutionary algorithms. In order to evaluate the behaviour of the algorithm, an analysis of different parameters is performed by means of analysis of variance (ANOVA). The evaluation was carried out using artificial datasets, and with UCI datasets. The results suggest that the proposed method provides an effective and robust technique for discretizating variables.