Analyzing the impact of the discretization method when comparing Bayesian classifiers

  • Authors:
  • M. Julia Flores;José A. Gámez;Ana M. Martínez;José M. Puerta

  • Affiliations:
  • Computer Systems Department, Intelligent Systems & Data Mining, SIMD, University of Castilla-La Mancha, Albacete, Spain;Computer Systems Department, Intelligent Systems & Data Mining, SIMD, University of Castilla-La Mancha, Albacete, Spain;Computer Systems Department, Intelligent Systems & Data Mining, SIMD, University of Castilla-La Mancha, Albacete, Spain;Computer Systems Department, Intelligent Systems & Data Mining, SIMD, University of Castilla-La Mancha, Albacete, Spain

  • Venue:
  • IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part I
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most of the methods designed within the framework of Bayesian networks (BNs) assume that the involved variables are of discrete nature, but this assumption rarely holds in real problems. The Bayesian classifier AODE (Aggregating One-Dependence Estimators) e.g. can only work directly with discrete variables. The HAODE (from Hybrid AODE) classifier is proposed as an appealing alternative to AODE which is less affected by the discretization process. In this paper, we study if this behavior holds when applying different discretization methods. More importantly, we include other Bayesian classifiers in the comparison to find out to what extent the type of discretization affects their results in terms of accuracy and bias-variance discretization. If the type of discretization applied is not decisive, then future experiments can be k times faster, k being the number of discretization methods considered.