Single-pass incremental and interactive mining for weighted frequent patterns

  • Authors:
  • Chowdhury Farhan Ahmed;Syed Khairuzzaman Tanbeer;Byeong-Soo Jeong;Young-Koo Lee;Ho-Jin Choi

  • Affiliations:
  • Department of Computer Engineering, Kyung Hee University, 1 Seochun-dong, Kihung-gu, Youngin-si, Kyunggi-do, 446-701, Republic of Korea;Department of Computer Engineering, Kyung Hee University, 1 Seochun-dong, Kihung-gu, Youngin-si, Kyunggi-do, 446-701, Republic of Korea;Department of Computer Engineering, Kyung Hee University, 1 Seochun-dong, Kihung-gu, Youngin-si, Kyunggi-do, 446-701, Republic of Korea;Department of Computer Engineering, Kyung Hee University, 1 Seochun-dong, Kihung-gu, Youngin-si, Kyunggi-do, 446-701, Republic of Korea;Department of Computer Science, Korea Advanced Institute of Science and Technology (KAIST), 335 Gwahak-ro, Yuseong-gu, Daejeon 305-701, Republic of Korea

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 12.06

Visualization

Abstract

Weighted frequent pattern (WFP) mining is more practical than frequent pattern mining because it can consider different semantic significance (weight) of the items. For this reason, WFP mining becomes an important research issue in data mining and knowledge discovery. However, existing algorithms cannot be applied for incremental and interactive WFP mining and also for stream data mining because they are based on a static database and require multiple database scans. In this paper, we present two novel tree structures IWFPT"W"A (Incremental WFP tree based on weight ascending order) and IWFPT"F"D (Incremental WFP tree based on frequency descending order), and two new algorithms IWFP"W"A and IWFP"F"D for incremental and interactive WFP mining using a single database scan. They are effective for incremental and interactive mining to utilize the current tree structure and to use the previous mining results when a database is updated or a minimum support threshold is changed. IWFP"W"A gets advantage in candidate pattern generation by obtaining the highest weighted item in the bottom of IWFPT"W"A. IWFP"F"D ensures that any non-candidate item cannot appear before candidate items in any branch of IWFPT"F"D and thus speeds up the prefix tree and conditional tree creation time during mining operation. IWFPT"F"D also achieves the highly compact incremental tree to save memory space. To our knowledge, this is the first research work to perform single-pass incremental and interactive mining for weighted frequent patterns. Extensive performance analyses show that our tree structures and algorithms are very efficient and scalable for single-pass incremental and interactive WFP mining.