Index-BitTableFI: An improved algorithm for mining frequent itemsets

Authors:
Wei Song;Bingru Yang;Zhangyan Xu
Affiliations:
College of Information Engineering, North China University of Technology, Beijing 100144, China;School of Information Engineering, University of Science and Technology, Beijing, Beijing 100083, China;Department of Computer, Guanxi Normal University, Guilin 541004, China
Venue:
Knowledge-Based Systems
Year:
2008

Citing 20
Cited 9

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
An effective hash-based algorithm for mining association rules

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Dynamic itemset counting and implication rules for market basket data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Integrating association rule mining with relational database systems: alternatives and implications

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Mining frequent patterns without candidate generation

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Discovery of Frequent Episodes in Event Sequences

Data Mining and Knowledge Discovery
Complete Mining of Frequent Patterns from Graphs: Mining Graph Data

Machine Learning
Scalable Algorithms for Association Mining

IEEE Transactions on Knowledge and Data Engineering
Mining Sequential Patterns

ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
An Efficient Algorithm for Mining Association Rules in Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Sampling Large Databases for Association Rules

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Fast vertical mining using diffsets

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Fast Algorithms for Frequent Itemset Mining Using FP-Trees

IEEE Transactions on Knowledge and Data Engineering
Mining Approximate Frequent Itemsets from Noisy Data

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Mining lossless closed frequent patterns with weight constraints

Knowledge-Based Systems
Non-derivable itemset mining

Data Mining and Knowledge Discovery
BitTableFI: An efficient mining frequent itemsets algorithm

Knowledge-Based Systems
CBAR: an efficient method for mining association rules

Knowledge-Based Systems

Advanced Matrix Algorithm (AMA): reducing number of scans for association rule generation

International Journal of Business Intelligence and Data Mining
An improved association rules mining method

Expert Systems with Applications: An International Journal
Distributed BitTable multi-agent association rules mining algorithm

KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part I
DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets

Expert Systems with Applications: An International Journal
Single-pass incremental and interactive mining for weighted frequent patterns

Expert Systems with Applications: An International Journal
Efficient colossal pattern mining in high dimensional datasets

Knowledge-Based Systems
Interrelation analysis of celestial spectra data using constrained frequent pattern trees

Knowledge-Based Systems
An efficient method for mining frequent itemsets with double constraints

Engineering Applications of Artificial Intelligence
Mining high utility itemsets by dynamically pruning the tree structure

Applied Intelligence

Quantified Score

Hi-index	0.01

Visualization

Abstract

Efficient algorithms for mining frequent itemsets are crucial for mining association rules as well as for many other data mining tasks. Methods for mining frequent itemsets have been implemented using a BitTable structure. BitTableFI is such a recently proposed efficient BitTable-based algorithm, which exploits BitTable both horizontally and vertically. Although making use of efficient bit wise operations, BitTableFI still may suffer from the high cost of candidate generation and test. To address this problem, a new algorithm Index-BitTableFI is proposed. Index-BitTableFI also uses BitTable horizontally and vertically. To make use of BitTable horizontally, index array and the corresponding computing method are proposed. By computing the subsume index, those itemsets that co-occurrence with representative item can be identified quickly by using breadth-first search at one time. Then, for the resulting itemsets generated through the index array, depth-first search strategy is used to generate all other frequent itemsets. Thus, the hybrid search is implemented, and the search space is reduced greatly. The advantages of the proposed methods are as follows. On the one hand, the redundant operations on intersection of tidsets and frequency-checking can be avoided greatly; On the other hand, it is proved that frequent itemsets, including representative item and having the same supports as representative item, can be identified directly by connecting the representative item with all the combinations of items in its subsume index. Thus, the cost for processing this kind of itemsets is lowered, and the efficiency is improved. Experimental results show that the proposed algorithm is efficient especially for dense datasets.