Isolated items discarding strategy for discovering high utility itemsets

  • Authors:
  • Yu-Chiang Li;Jieh-Shan Yeh;Chin-Chen Chang

  • Affiliations:
  • Department of Computer Science and Information Engineering, Southern Taiwan University, No. 1, Nantai St., Yung-Kang City, Tainan 710, Taiwan, ROC;Department of Computer Science and Information Management, Providence University, 200 Chung-Chi, Shalu, Taichung 433, Taiwan, ROC;Department of Information Engineering and Computer Science, Feng Chia University, 100 Wenhwa, Seatwen, Taichung 40724, Taiwan, ROC and Department of Computer Science and Information Engineering, N ...

  • Venue:
  • Data & Knowledge Engineering
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

Traditional methods of association rule mining consider the appearance of an item in a transaction, whether or not it is purchased, as a binary variable. However, customers may purchase more than one of the same item, and the unit cost may vary among items. Utility mining, a generalized form of the share mining model, attempts to overcome this problem. Since the Apriori pruning strategy cannot identify high utility itemsets, developing an efficient algorithm is crucial for utility mining. This study proposes the Isolated Items Discarding Strategy (IIDS), which can be applied to any existing level-wise utility mining method to reduce candidates and to improve performance. The most efficient known models for share mining are ShFSM and DCG, which also work adequately for utility mining as well. By applying IIDS to ShFSM and DCG, the two methods FUM and DCG+ were implemented, respectively. For both synthetic and real datasets, experimental results reveal that the performance of FUM and DCG+ is more efficient than that of ShFSM and DCG, respectively. Therefore, IIDS is an effective strategy for utility mining.