Efficient Rule-Based Attribute-Oriented Induction for Data Mining

  • Authors:
  • David W. Cheung;H. Y. Hwang;Ada W. Fu;Jiawei Han

  • Affiliations:
  • Department of Computer Science and Information Systems, The University of Hong Kong, Hong Kong. dcheung@csis.hku.hk;Department of Computer Science and Engineering, Chinese University of Hong Kong, Hong Kong;Department of Computer Science and Engineering, Chinese University of Hong Kong, Hong Kong. adafu@cse.cuhk.edu.hk;School of Computing Science, Simon Fraser University, Canada. han@cs.sfu.ca

  • Venue:
  • Journal of Intelligent Information Systems
  • Year:
  • 2000

Quantified Score

Hi-index 0.01

Visualization

Abstract

Data mining has become an important technique which has tremendous potential in many commercial and industrial applications. Attribute-oriented induction is a powerful mining technique and has been successfully implemented in the data mining system DBMiner (Han et al. Proc. 1996 Int'l Conf. on Data Mining and Knowledge Discovery (KDD'96), Portland, Oregon, 1996). However, its induction capability is limited by the unconditional concept generalization. In this paper, we extend the concept generalization to rule-based concept hierarchy, which enhances greatly its induction power. When previously proposed induction algorithm is applied to the more general rule-based case, a problem of induction anomaly occurs which impacts its efficiency. We have developed an efficient algorithm to facilitate induction on the rule-based case which can avoid the anomaly. Performance studies have shown that the algorithm is superior than a previously proposed algorithm based on backtracking.