Rule synthesizing from multiple related databases

  • Authors:
  • Dan He;Xindong Wu;Xingquan Zhu

  • Affiliations:
  • Dept of Computer Sci., Univ of California Los Angeles, LA, CA;,Dept of Computer Sci., Univ of Vermont, Burlington, VT;,QCIS Center, Eng & IT, Univ of Technology, Sydney, NSW, Australia

  • Venue:
  • PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we study the problem of rule synthesizing from multiple related databases where items representing the databases may be different, and the databases may not be relevant, or similar to each other We argue that, for such multi-related databases, simple rule synthesizing without a detailed understanding of the databases is not able to reveal meaningful patterns inside the data collections Consequently, we propose a two-step clustering on the databases at both item and rule levels such that the databases in the final clusters contain both similar items and similar rules A weighted rule synthesizing method is then applied on each such cluster to generate final rules Experimental results demonstrate that the new rule synthesizing method is able to discover important rules which can not be synthesized by other methods.