CBC: An associative classifier with a small number of rules

  • Authors:
  • Houtao Deng;George Runger;Eugene Tuv;Wade Bannister

  • Affiliations:
  • Intuit, Mountain View, CA, USA;Arizona State University, Tempe, AZ, USA;Intel, Chandler, AZ, USA;Ingenix Consulting, Irvine, CA, USA

  • Venue:
  • Decision Support Systems
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

Associative classifiers have been proposed to achieve an accurate model with each individual rule being interpretable. However, existing associative classifiers often consist of a large number of rules and, thus, can be difficult to interpret. We show that associative classifiers consisting of an ordered rule set can be represented as a tree model. From this view, it is clear that these classifiers are restricted in that at least one child node of a non-leaf node is never split. We propose a new tree model, i.e., condition-based tree (CBT), to relax the restriction. Furthermore, we also propose an algorithm to transform a CBT to an ordered rule set with concise rule conditions. This ordered rule set is referred to as a condition-based classifier (CBC). Thus, the interpretability of an associative classifier is maintained, but more expressive models are possible. The rule transformation algorithm can be also applied to regular binary decision trees to extract an ordered set of rules with simple rule conditions. Feature selection is applied to a binary representation of conditions to simplify/improve the models further. Experimental studies show that CBC has competitive accuracy performance, and has a significantly smaller number of rules (median of 10 rules per data set) than well-known associative classifiers such as CBA (median of 47) and GARC (median of 21). CBC with feature selection has even a smaller number of rules.