Mining quantitative associations in large database

Authors:
Chenyong Hu;Yongji Wang;Benyu Zhang;Qiang Yang;Qing Wang;Jinhui Zhou;Ran He;Yun Yan
Affiliations:
Lab for Internet Software Technologies, Institute of Software, Chinese Academy of Sciences, Beijing;Lab for Internet Software Technologies, Institute of Software, Chinese Academy of Sciences, Beijing;Microsoft Research Asia, Beijing, P.R. China;Department of Computer Science, Hong Kong University of Science and Technology;Lab for Internet Software Technologies, Institute of Software, Chinese Academy of Sciences, Beijing;Lab for Internet Software Technologies, Institute of Software, Chinese Academy of Sciences, Beijing;Lab for Internet Software Technologies, Institute of Software, Chinese Academy of Sciences, Beijing;LMAM, Department of Information Science, School of Mathematical Science, Peking University, Beijing
Venue:
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
Year:
2005

Citing 15
Cited 0

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Mining quantitative association rules in large relational tables

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
A statistical theory for quantitative association rules

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Parallel Algorithms for Discovery of Association Rules

Data Mining and Knowledge Discovery
Mining Association Rules: Anti-Skew Algorithms

ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Discovery of Multiple-Level Association Rules from Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Mining Associations by Pattern Structure in Large Relational Tables

ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Mining Association Rules from Stars

ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Facilitating Fuzzy Association Rules Mining by Using Multi-Objective Genetic Algorithms for Automated Clustering

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Mining Frequent Itemsets in Distributed and Dynamic Databases

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
The Rough Set Approach to Association Rule Mining

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
A High-Performance Distributed Algorithm for Mining Association Rules

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Association Rule Mining in Peer-to-Peer Systems

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

Association Rule Mining algorithms operate on a data matrix to derive association rule, discarding the quantities of the items, which contains valuable information. In order to make full use of the knowledge inherent in the quantities of the items, an extension named Ratio Rules [6] is proposed to capture the quantitative association. However, the approach, which is addressed in [6], is mainly based on Principle Component Analysis (PCA) and as a result, it cannot guarantee that the ratio coefficient is non-negative. This may lead to serious problems in the association rules' application. In this paper, a new method, called Principal Non-negative Sparse Coding (PNSC), is provided for learning the associations between itemsets in the form of Ratio Rules. Experiments on several datasets illustrate that the proposed method performs well for the purpose of discovering latent associations between itemsets in large datasets.