Mining frequent item sets by opportunistic projection

Authors:
Junqiang Liu;Yunhe Pan;Ke Wang;Jiawei Han
Affiliations:
Institute of Artificial Intelligence, Zhejiang University;Institute of Artificial Intelligence, Zhejiang University;Simon Fraser University;UIUC
Venue:
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2002

Citing 15
Cited 38

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
An effective hash-based algorithm for mining association rules

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Dynamic itemset counting and implication rules for market basket data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Efficiently mining long patterns from databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Mining frequent patterns without candidate generation

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Depth first generation of long patterns

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
A tree projection algorithm for generation of frequent item sets

Journal of Parallel and Distributed Computing - Special issue on high-performance data mining
Real world performance of association rule algorithms

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Pincer Search: A New Algorithm for Discovering the Maximum Frequent Set

EDBT '98 Proceedings of the 6th International Conference on Extending Database Technology: Advances in Database Technology
MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases

Proceedings of the 17th International Conference on Data Engineering
H-Mine: Hyper-Structure Mining of Frequent Patterns in Large Databases

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Discovery of Multiple-Level Association Rules from Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
An Efficient Algorithm for Mining Association Rules in Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Sampling Large Databases for Association Rules

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases

Database research at the University of Illinois at Urbana-Champaign

ACM SIGMOD Record
CT-ITL: efficient frequent item set mining using a compressed prefix tree with pattern growth

ADC '03 Proceedings of the 14th Australasian database conference - Volume 17
Inverted matrix: efficient discovery of frequent items in large datasets in the context of interactive mining

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
CLOSET+: searching for the best strategies for mining frequent closed itemsets

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
On computing, storing and querying frequent patterns

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient dynamic mining of constrained frequent sets

ACM Transactions on Database Systems (TODS)
Statistical properties of transactional databases

Proceedings of the 2004 ACM symposium on Applied computing
Go Green: Recycle and Reuse Frequent Patterns

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Algorithms for mining association rules in bag databases

Information Sciences—Informatics and Computer Science: An International Journal
Tight upper bounds on the number of candidate patterns

ACM Transactions on Database Systems (TODS)
Fast and Memory Efficient Mining of Frequent Closed Itemsets

IEEE Transactions on Knowledge and Data Engineering
Association mining

ACM Computing Surveys (CSUR)
Market basket analysis in a multiple store environment

Decision Support Systems
An efficient approach to mining indirect associations

Journal of Intelligent Information Systems
Hiding Sensitive Association Rules with Limited Side Effects

IEEE Transactions on Knowledge and Data Engineering
A data mining approach for retail knowledge discovery with consideration of the effect of shelf-space adjacency on sales

Decision Support Systems
Efficient mining of weighted interesting patterns with a strong weight and/or support affinity

Information Sciences: an International Journal
A new approach to mine frequent patterns using item-transformation methods

Information Systems
Isolated items discarding strategy for discovering high utility itemsets

Data & Knowledge Engineering
Mining fault-tolerant frequent patterns efficiently with powerful pruning

Proceedings of the 2008 ACM symposium on Applied computing
A data mining proxy approach for efficient frequent itemset mining

The VLDB Journal — The International Journal on Very Large Data Bases
ON DATA STRUCTURES FOR ASSOCIATION RULE DISCOVERY

Applied Artificial Intelligence
An efficient mining of weighted frequent patterns with length decreasing support constraints

Knowledge-Based Systems
Efficient mining of interesting weighted patterns from directed graph traversals

Integrated Computer-Aided Engineering
On pushing weight constraints deeply into frequent itemset mining

Intelligent Data Analysis
Two-phase algorithms for a novel utility-frequent mining model

PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
Discovering itemset interactions

ACSC '09 Proceedings of the Thirty-Second Australasian Conference on Computer Science - Volume 91
Effective utility mining with the measure of average utility

Expert Systems with Applications: An International Journal
A new mining approach for uncertain databases using CUFP trees

Expert Systems with Applications: An International Journal
A fast algorithm for mining share-frequent itemsets

APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
Mining value-based item packages – an integer programming approach

Data Mining
A sampling-based method for mining frequent patterns from databases

FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
Direct candidates generation: a novel algorithm for discovering complete share-frequent itemsets

FSKD'05 Proceedings of the Second international conference on Fuzzy Systems and Knowledge Discovery - Volume Part II
WLPMiner: weighted frequent pattern mining with length-decreasing support constraints

PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Finding closed itemsets in data streams

KES'05 Proceedings of the 9th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Efficient colossal pattern mining in high dimensional datasets

Knowledge-Based Systems
Parallel frequent itemset mining using systolic arrays

Knowledge-Based Systems
Accelerating frequent item counting with FPGA

Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a novel algorithm Opportune Project for mining complete set of frequent item sets by projecting databases to grow a frequent item set tree. Our algorithm is fundamentally different from those proposed in the past in that it opportunistically chooses between two different structures, array-based or tree-based, to represent projected transaction subsets, and heuristically decides to build unfiltered pseudo projection or to make a filtered copy according to features of the subsets. More importantly, we propose novel methods to build tree-based pseudo projections and array-based unfiltered projections for projected transaction subsets, which makes our algorithm both CPU time efficient and memory saving. Basically, the algorithm grows the frequent item set tree by depth first search, whereas breadth first search is used to build the upper portion of the tree if necessary. We test our algorithm versus several other algorithms on real world datasets, such as BMS-POS, and on IBM artificial datasets. The empirical results show that our algorithm is not only the most efficient on both sparse and dense databases at all levels of support threshold, but also highly scalable to very large databases.