Learning locally weighted C4.4 for class probability estimation

  • Authors:
  • Liangxiao Jiang;Harry Zhang;Dianhong Wang;Zhihua Cai

  • Affiliations:
  • Faculty of Computer Science, China University of Geosciences, Wuhan, Hubei, P.R. China;Faculty of Computer Science, University of New Brunswick, Fredericton, NB, Canada;Faculty of Computer Science, China University of Geosciences, Wuhan, Hubei, P.R. China;Faculty of Computer Science, China University of Geosciences, Wuhan, Hubei, P.R. China

  • Venue:
  • DS'07 Proceedings of the 10th international conference on Discovery science
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In many real-world data mining applications, accurate class probability estimations are often required to make optimal decisions. For example, in direct marketing, we often need to deploy different promotion strategies to customers with different likelihood (probability) of buying some products. When our learning task is to build a model with accurate class probability estimations, C4.4 is the most popular one for achieving this task because of its efficiency and effect. In this paper, we present a locally weighted version of C4.4 to scale up its class probability estimation performance by combining locally weighted learning with C4.4. We call our improved algorithm locally weighted C4.4, simply LWC4.4. We experimentally tested LWC4.4 using the whole 36 UCI data sets selected by Weka, and compared it to other related algorithms: C4.4, NB, KNN, NBTree, and LWNB. The experimental results show that LWC4.4 significantly outperforms all the other algorithms in term of conditional log likelihood, simply CLL. Thus, our work provides an effective algorithm to produce accurate class probability estimation.