Progressive weighted miner: an efficient method for time-constraint mining

  • Authors:
  • Chang-Hung Lee;Jian Chih Ou;Ming-Syan Chen

  • Affiliations:
  • BenQ Corporation, Taipei, Taiwan, R.O.C.;Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan, ROC;Department of Electrical Engineering, National Taiwan University, Taipei, Taiwan, ROC

  • Venue:
  • PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The discovery of association relationship among the data in a huge database has been known to be useful in selective marketing, decision analysis, and business management. A significant amount of research effort has been elaborated upon the development of efficient algorithms for data mining. However, without fully considering the time-variant characteristics of items and transactions, it is noted that some discovered rules may be expired from users' interest. In other words, some discovered knowledge may be obsolete and of little use, especially when we perform the mining schemes on a transaction database of short life cycle products. This aspect is, however, rarely addressed in prior studies. To remedy this, we broaden in this paper the horizon of frequent pattern mining by introducing a weighted model of transaction-weighted association rules in a time-variant database. Specifically, we propose an efficient Progressive Weighted Miner (abbreviatedly as PWM) algorithm to perform the mining for this problem as well as conduct the corresponding performance studies. In algorithm PWM, the importance of each transaction period is first reflected by a proper weight assigned by the user. Then, PWM partitions the time-variant database in light of weighted periods of transactions and performs weighted mining. Algorithm PWM is designed to progressively accumulate the itemset counts based on the intrinsic partitioning characteristics and employ a filtering threshold in each partition to early prune out those cumulatively infrequent 2-itemsets. With this design, algorithm PWM is able to efficiently produce weighted association rules for applications where different time periods are assigned with different weights and lead to results of more interest.