On the strength of hyperclique patterns for text categorization
Information Sciences: an International Journal
Correlation search in graph databases
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Discovery of maximum length frequent itemsets
Information Sciences: an International Journal
Correlated pattern mining in quantitative databases
ACM Transactions on Database Systems (TODS)
Hypergraph partitioning for document clustering: a unified clique perspective
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Selecting the Right Features for Bipartite-Based Text Clustering
ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Efficient Single-Pass Mining of Weighted Interesting Patterns
AI '08 Proceedings of the 21st Australasian Joint Conference on Artificial Intelligence: Advances in Artificial Intelligence
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Mining Mutually Dependent Ordered Subtrees in Tree Databases
New Frontiers in Applied Data Mining
Handling Dynamic Weights in Weighted Frequent Pattern Mining
IEICE - Transactions on Information and Systems
An Improved Algorithm for Mining Non-Redundant Interacting Feature Subsets
APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Association Analysis Techniques for Bioinformatics Problems
BICoB '09 Proceedings of the 1st International Conference on Bioinformatics and Computational Biology
Issues in pattern mining and their resolutions
C3S2E '09 Proceedings of the 2nd Canadian Conference on Computer Science and Software Engineering
Discovery of Correlated Sequential Subgraphs from a Sequence of Graphs
ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
Semantic feature selection for object discovery in high-resolution remote sensing imagery
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Term weighting evaluation in bipartite partitioning for text clustering
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Mining correlated subgraphs in graph databases
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Validation of overlapping clustering: A random clustering perspective
Information Sciences: an International Journal
A statistical interestingness measures for XML based association rules
PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
Scaling up top-K cosine similarity search
Data & Knowledge Engineering
DS'10 Proceedings of the 13th international conference on Discovery science
HUC-Prune: an efficient candidate pruning technique to mine high utility patterns
Applied Intelligence
Cosine interesting pattern discovery
Information Sciences: an International Journal
Single-pass incremental and interactive mining for weighted frequent patterns
Expert Systems with Applications: An International Journal
Interactive mining of high utility patterns over data streams
Expert Systems with Applications: An International Journal
New exact concise representation of rare correlated patterns: application to intrusion detection
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
CGStream: continuous correlated graph query for data streams
Proceedings of the 21st ACM international conference on Information and knowledge management
Mining popular patterns from transactional databases
DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
ShrFP-tree: an efficient tree structure for mining share-frequent patterns
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Comparative study of text clustering techniques in virtual worlds
Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics
Mining association rules with rare and frequent items
International Journal of Knowledge Engineering and Data Mining
Editorial: data mining in electronic commerce - support vs. confidence
Journal of Theoretical and Applied Electronic Commerce Research
Scaling up cosine interesting pattern discovery: A depth-first method
Information Sciences: an International Journal
Hi-index | 0.00 |
Existing algorithms for mining association patterns often rely on the support-based pruning strategy to prune a combinatorial search space. However, this strategy is not effective for discovering potentially interesting patterns at low levels of support. Also, it tends to generate too many spurious patterns involving items which are from different support levels and are poorly correlated. In this paper, we present a framework for mining highly-correlated association patterns called hyperclique patterns. In this framework, an objective measure called h-confidence is applied to discover hyperclique patterns. We prove that the items in a hyperclique pattern have a guaranteed level of global pairwise similarity to one another as measured by the cosine similarity (uncentered Pearson's correlation coefficient). Also, we show that the h-confidence measure satisfies a cross-support property which can help efficiently eliminate spurious patterns involving items with substantially different support levels. Indeed, this cross-support property is not limited to h-confidence and can be generalized to some other association measures. In addition, an algorithm called hyperclique miner is proposed to exploit both cross-support and anti-monotone properties of the h-confidence measure for the efficient discovery of hyperclique patterns. Finally, our experimental results show that hyperclique miner can efficiently identify hyperclique patterns, even at extremely low levels of support.