Classification based on association rules: A lattice-based approach

  • Authors:
  • Loan T. T. Nguyen;Bay Vo;Tzung-Pei Hong;Hoang Chi Thanh

  • Affiliations:
  • Faculty of Information Technology, Broadcasting College II, Ho Chi Minh, Viet Nam;Information Technology College, Ho Chi Minh, Viet Nam;Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan, ROC and Department of Computer Science and Engineering, National Sun Yat-sen Univer ...;Department of Informatics, Ha Noi University of Science, Ha Noi, Viet Nam

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 12.05

Visualization

Abstract

Classification plays an important role in decision support systems. A lot of methods for mining classification rules have been developed in recent years, such as C4.5 and ILA. These methods are, however, based on heuristics and greedy approaches to generate rule sets that are either too general or too overfitting for a given dataset. They thus often yield high error ratios. Recently, a new method for classification from data mining, called the Classification Based on Associations (CBA), has been proposed for mining class-association rules (CARs). This method has more advantages than the heuristic and greedy methods in that the former could easily remove noise, and the accuracy is thus higher. It can additionally generate a rule set that is more complete than C4.5 and ILA. One of the weaknesses of mining CARs is that it consumes more time than C4.5 and ILA because it has to check its generated rule with the set of the other rules. We thus propose an efficient pruning approach to build a classifier quickly. Firstly, we design a lattice structure and propose an algorithm for fast mining CARs using this lattice. Secondly, we develop some theorems and propose an algorithm for pruning redundant rules quickly based on these theorems. Experimental results also show that the proposed approach is more efficient than those used previously.