A Top-Down and Greedy Method for Discretization of Continuous Attributes

  • Authors:
  • Chien-I Lee;Cheng-Jung Tsai;Ya-Ru Yang;Wei-Pang Yang

  • Affiliations:
  • National University of Tainan;National Chiao Tung University;National University of Tainan;National Dong Hwa University,

  • Venue:
  • FSKD '07 Proceedings of the Fourth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 01
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Experiments show that CAIM discretization algorithm is superior to all the other top-down discretization algorithms. However, CAIM algorithm does not take the data distribution into account. The discretization formula used in CAIM also gives a high factor to the numbers of generated intervals. The two disadvantages make CAIM may generate irrational discrete results in some cases and further leads to the decrease of predictive accuracy of a classifier. In this paper we propose the Class-Attribute Contingency Coefficient discretization algorithm. The experimental results showed that compared with CAIM, our method can generate a better discretization scheme to bring on the improvement of accuracy of classification. With regard to the number of generated rules and execution time of a classifier, CACC and CAIM achieve comparable results.