Mining frequent itemsets in large databases: The hierarchical partitioning approach

Authors:
Fan-Chen Tseng
Affiliations:
Department of Multimedia and M-Commerce, Kainan University, No. 1, Kainan Road, Luzhu, Taoyuan County 33857, Taiwan, ROC
Venue:
Expert Systems with Applications: An International Journal
Year:
2013

Citing 16
Cited 0

Mining frequent patterns without candidate generation

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Data Mining: An Overview from a Database Perspective

IEEE Transactions on Knowledge and Data Engineering
Discovering Frequent Closed Itemsets for Association Rules

ICDT '99 Proceedings of the 7th International Conference on Database Theory
H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Mining Frequent Closed Itemsets with the Frequent Pattern List

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
An Efficient Algorithm for Mining Association Rules in Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Generating Frequent Patterns with the Frequent Pattern List

PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach

Data Mining and Knowledge Discovery
CLOSET+: searching for the best strategies for mining frequent closed itemsets

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient Algorithms for Mining Closed Itemsets and Their Lattice Structure

IEEE Transactions on Knowledge and Data Engineering
Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
Segmenting online gamers by motivation

Expert Systems with Applications: An International Journal
Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets

Expert Systems with Applications: An International Journal
An adaptive approach to mining frequent itemsets efficiently

Expert Systems with Applications: An International Journal
To Bid or to Buy?: Online Shoppers' Preferences for Online Purchasing Channels

International Journal of E-Business Research

Quantified Score

Hi-index	12.05

Visualization

Abstract

Although many methods have been proposed to enhance the efficiencies of data mining, little research has been devoted to the issue of scalability - that is, the problem of mining frequent itemsets when the size of the database is very large. This study proposes a methodology, hierarchical partitioning, for mining frequent itemsets in large databases, based on a novel data structure called the Frequent Pattern List (FPL). One of the major features of the FPL is its ability to partition the database, and thus transform the database into a set of sub-databases of manageable sizes. As a result, a divide-and-conquer approach can be developed to perform the desired data-mining tasks. Experimental results show that hierarchical partitioning is capable of mining frequent itemsets and frequent closed itemsets in very large databases.