Mining for combined association rules on multiple datasets

  • Authors:
  • Yanchang Zhao;Huaifeng Zhang;Fernando Figueiredo;Longbing Cao;Chengqi Zhang

  • Affiliations:
  • University of Technology, Sydney, Australia;University of Technology, Sydney, Australia;Centrelink, Australia;University of Technology, Sydney, Australia;University of Technology, Sydney, Australia

  • Venue:
  • Proceedings of the 2007 international workshop on Domain driven data mining
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many organisations have their digital information stored in a distributed systems structure scheme, be it in different locations, using vertically and horizontally distributed repositories, which brings about an high level of complexity to data mining. From a classical data mining view, where the algorithms expect a denormalised structure to be able to operate on, heterogeneous data sources, such as static demographic and dynamic transactional data are to be manipulated and integrated to the extent commercial association rules algorithms can be applied. Bearing in mind the usefulness and understandability of the application from a business perspective, combined rules of multiple patterns derived from different repositories, containing historical and point in time data, were used to produce new techniques in association mining applied to debt recovery. Initially debt repayment patterns were discovered using transactional data and class labels defined by domain expertise, then demographic patterns were attached to each of the class labels. After combining the patterns, two type of rules were discovered leading to different results: 1) same demographic pattern with different repayment patterns, and 2) same repayment pattern with different demographic patterns. The rules produced are interesting, valuable, complete and understandable, which shows the applicability and effectiveness of the new method.