Discovery of maximum length frequent itemsets

Authors:
Tianming Hu;Sam Yuan Sung;Hui Xiong;Qian Fu
Affiliations:
East China Normal University, DongGuan University of Technology, China;Department of Computer Science, South Texas College, USA;MSIS Department, Rutgers University, USA;Department of Computer Science, National University of Singapore, Singapore
Venue:
Information Sciences: an International Journal
Year:
2008

Citing 18
Cited 21

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Efficiently mining long patterns from databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Mining frequent patterns without candidate generation

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Turbo-charging vertical mining of large databases

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Depth first generation of long patterns

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Algorithms for association rule mining — a general survey and comparison

ACM SIGKDD Explorations Newsletter
SPADE: an efficient algorithm for mining frequent sequences

Machine Learning
Mining Sequential Patterns: Generalizations and Performance Improvements

EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
Pincer Search: A New Algorithm for Discovering the Maximum Frequent Set

EDBT '98 Proceedings of the 6th International Conference on Extending Database Technology: Advances in Database Technology
MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases

Proceedings of the 17th International Conference on Data Engineering
Efficiently Mining Maximal Frequent Itemsets

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Dataset Filtering Techniques in Constraint-Based Frequent Pattern Mining

Proceedings of the ESF Exploratory Workshop on Pattern Detection and Discovery
Frequent term-based text clustering

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining Strong Affinity Association Patterns in Data Sets with Skewed Support Distribution

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Hyperclique pattern discovery

Data Mining and Knowledge Discovery
Looking into the seeds of time: Discovering temporal patterns in large transaction sets

Information Sciences: an International Journal
A false negative approach to mining frequent itemsets from high speed transactional data streams

Information Sciences: an International Journal

On discovery of soft associations with "most" fuzzy quantifier for item promotion applications

Information Sciences: an International Journal
Efficient single-pass frequent pattern mining using a prefix-tree

Information Sciences: an International Journal
Top-down mining of frequent closed patterns from very high dimensional data

Information Sciences: an International Journal
FIUT: A new method for mining frequent itemsets

Information Sciences: an International Journal
Mining frequent trajectory patterns in spatial-temporal databases

Information Sciences: an International Journal
Discovering Periodic-Frequent Patterns in Transactional Databases

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
An order-clique-based approach for mining maximal co-locations

Information Sciences: an International Journal
Sliding window-based frequent pattern mining over data streams

Information Sciences: an International Journal
An approach to discovering multi-temporal patterns and its application to financial databases

Information Sciences: an International Journal
Novel alarm correlation analysis system based on association rules mining in telecommunication networks

Information Sciences: an International Journal
Toward boosting distributed association rule mining by data de-clustering

Information Sciences: an International Journal
Finding top-k elements in data streams

Information Sciences: an International Journal
An improved association rules mining method

Expert Systems with Applications: An International Journal
Discovering discriminative test items for achievement tests

Expert Systems with Applications: An International Journal
A case retrieval approach using similarity and association knowledge

OTM'11 Proceedings of the 2011th Confederated international conference on On the move to meaningful internet systems - Volume Part I
Single-pass incremental and interactive mining for weighted frequent patterns

Expert Systems with Applications: An International Journal
Retrieval in CBR using a combination of similarity and association knowledge

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part I
Mining frequent patterns and association rules using similarities

Expert Systems with Applications: An International Journal
Clustering Software Components for Component Reuse and Program Restructuring

Proceedings of the Second International Conference on Innovative Computing and Cloud Computing
Efficient frequent pattern mining based on Linear Prefix tree

Knowledge-Based Systems
Discovering diverse-frequent patterns in transactional databases

Proceedings of the 17th International Conference on Management of Data

Quantified Score

Hi-index	0.08

Visualization

Abstract

The use of frequent itemsets has been limited by the high computational cost as well as the large number of resulting itemsets. In many real-world scenarios, however, it is often sufficient to mine a small representative subset of frequent itemsets with low computational cost. To that end, in this paper, we define a new problem of finding the frequent itemsets with a maximum length and present a novel algorithm to solve this problem. Indeed, maximum length frequent itemsets can be efficiently identified in very large data sets and are useful in many application domains. Our algorithm generates the maximum length frequent itemsets by adapting a pattern fragment growth methodology based on the FP-tree structure. Also, a number of optimization techniques have been exploited to prune the search space. Finally, extensive experiments on real-world data sets validate the proposed algorithm.