Transaction-item association matrix-based frequent pattern network mining algorithm in large-scale transaction database

  • Authors:
  • Wei-Qing Sun;Cheng-Min Wang;Tie-Yan Zhang;Yan Zhang

  • Affiliations:
  • Department of Electronic, Information & Electrical Engineering, Shanghai Jiaotong University, Shanghai, China;Department of Electronic, Information & Electrical Engineering, Shanghai Jiaotong University, Shanghai, China;Shenyang Institute of Engineering, Shenyang, Liaoning, China;Department of Electronic, Information & Electrical Engineering, Shanghai Jiaotong University, Shanghai, China

  • Venue:
  • WSEAS Transactions on Computers
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

To increase the efficiency of data mining is the emphasis in this field at present. Through the establishment of transaction-item association matrix, this paper changes the process of association rule mining to elementary matrix operation, which makes the process of data mining clear and simple. Compared with algorithms like Apriori, this method avoids the demerit of traversing the database repetitiously, and increases the efficiency of association rule mining obviously in the use of sparse storage technique for large-scale matrix. To incremental type of transaction matrix, it can also make the maintainment of association rule more convenient in the use of partitioning calculation technique of matrix. On the other and, aiming at the demerits in FP-growth algorithm, this paper proposes a FP-network model which compresses the data needed in association rule mining in a FP-network. Compared with the primary FP-tree model, the FP-network proposed is undirected, which enlarge the scale of transaction storage; furthermore, the FP-network is stored through the definition of transaction-item association matrix, it is convenient to make association rule mining on the basic of defining node capability. Experiment results show that the FP-network mining association rule algorithm proposed by this paper not only inherits the merits of FP-growth algorithm, but also maintains and updates data conveniently. It improves the efficiency of association rule mining significantly.