Efficient Mining of Association Rules in Distributed Databases

Authors:
David W. Cheung;Vincent T. Ng;Ada W. Fu;Yongjian Fu
Affiliations:
-;-;-;-
Venue:
IEEE Transactions on Knowledge and Data Engineering
Year:
1996

Citing 20
Cited 84

Principles of database and knowledge-base systems, Vol. I

Principles of database and knowledge-base systems, Vol. I
Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Finding interesting rules from large sets of discovered association rules

CIKM '94 Proceedings of the third international conference on Information and knowledge management
PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing

PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing
Efficient parallel data mining for association rules

CIKM '95 Proceedings of the fourth international conference on Information and knowledge management
An effective hash-based algorithm for mining association rules

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Advances in knowledge discovery and data mining

Advances in knowledge discovery and data mining
Knowledge Discovery in Databases

Knowledge Discovery in Databases
Parallel Mining of Association Rules

IEEE Transactions on Knowledge and Data Engineering
Data-Driven Discovery of Quantitative Rules in Relational Databases

IEEE Transactions on Knowledge and Data Engineering
Database Mining: A Performance Perspective

IEEE Transactions on Knowledge and Data Engineering
Efficient Similarity Search In Sequence Databases

FODO '93 Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms
Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
An Interval Classifier for Database Mining Applications

VLDB '92 Proceedings of the 18th International Conference on Very Large Data Bases
Efficient and Effective Clustering Methods for Spatial Data Mining

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Discovery of Multiple-Level Association Rules from Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
An Efficient Algorithm for Mining Association Rules in Large Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Mining Generalized Association Rules

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
A Case-Based Reasoning Approach for Associative Query Answering

ISMIS '94 Proceedings of the 8th International Symposium on Methodologies for Intelligent Systems

Association rules over interval data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
A localized algorithm for parallel association mining

Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures
Parallel mining algorithms for generalized association rules with classification hierarchy

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Using incremental pruning to increase the efficiency of dynamic itemset counting for mining association rules

Proceedings of the seventh international conference on Information and knowledge management
The application of association rule mining to remotely sensed data

SAC '00 Proceedings of the 2000 ACM symposium on Applied computing - Volume 1
High performance data mining (tutorial PM-3)

Tutorial notes of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Beyond intratransaction association analysis: mining multidimensional intertransaction association rules

ACM Transactions on Information Systems (TOIS)
Distributed, Collaborative Data Analysis from Heterogeneous Sites Using a Scalable Evolutionary Technique

Applied Intelligence
Parallel Algorithms for Discovery of Association Rules

Data Mining and Knowledge Discovery
Scalable Parallel Data Mining for Association Rules

IEEE Transactions on Knowledge and Data Engineering
Effect of Data Skewness and Workload Balance in Parallel Data Mining

IEEE Transactions on Knowledge and Data Engineering
Exploiting Data Mining Techniques for Broadcasting Data in Mobile Computing Environments

IEEE Transactions on Knowledge and Data Engineering
Distributed mining of classification rules

Knowledge and Information Systems
Developing Data Allocation Schemes by Incremental Mining of User Moving Patterns in a Mobile Computing System

IEEE Transactions on Knowledge and Data Engineering
Synthesizing High-Frequency Rules from Different Data Sources

IEEE Transactions on Knowledge and Data Engineering
A Data Mining Architecture for Distributed Environments

IICS '02 Proceedings of the Second International Workshop on Innovative Internet Computing Systems
Data Mining the Yeast Genome in a Lazy Functional Language

PADL '03 Proceedings of the 5th International Symposium on Practical Aspects of Declarative Languages
First Experiments for Mining Sequential Patterns on Distributed Sites with Multi-Agents

IDEAL '00 Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents
Frequent Itemset Counting Across Multiple Tables

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
Density-Based Mining of Quantitative Association Rules

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
Inducing Load Balancing and Efficient Data Distribution Prior to Association Rule Discovery in a Parallel Environment

Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
Efficient Parallel Algorithms for Mining Associations

Revised Papers from Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD
Parallel Sequence Mining on Shared-Memory Machines

Revised Papers from Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD
Parallel Generalized Association Rule Mining on Large Scale PC Cluster

Revised Papers from Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD
An Efficient Distributed Algorithm for Computing Association Rules

WAIM '00 Proceedings of the First International Conference on Web-Age Information Management
A template model for multidimensional inter-transactional association rules

The VLDB Journal — The International Journal on Very Large Data Bases
Privacy preserving association rule mining in vertically partitioned data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Connectionist and evolutionary models for learning, discovering and forecasting software effort

Managing data mining technologies in organizations
Mining User Moving Patterns for Personal Data Allocation in a Mobile Computing System

ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Capturing User Access Patterns in the Web for Data Mining

ICTAI '99 Proceedings of the 11th IEEE International Conference on Tools with Artificial Intelligence
Itemset Trees for Targeted Association Querying

IEEE Transactions on Knowledge and Data Engineering
A new distributed data mining model based on similarity

Proceedings of the 2003 ACM symposium on Applied computing
Distributed cooperative mining for information consortia

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Privacy-Preserving Distributed Mining of Association Rules on Horizontally Partitioned Data

IEEE Transactions on Knowledge and Data Engineering
An efficient strategy for mining exceptions in multi-databases

Information Sciences: an International Journal
Database classification for multi-database mining

Information Systems
From intra-transaction to generalized inter-transaction: landscaping multidimensional contexts in association rule mining

Information Sciences—Informatics and Computer Science: An International Journal
Data mining with the SAP NetWeaver BI accelerator

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Partitioning strategies for distributed association rule mining

The Knowledge Engineering Review
Association rules mining in vertically partitioned databases

Data & Knowledge Engineering - Special issue: WIDM 2004
Parallel mining of association rules from text databases

The Journal of Supercomputing
Association mining in time-varying domains

Intelligent Data Analysis
Searching for high-support itemsets in itemset trees

Intelligent Data Analysis
Secure set intersection cardinality with application to association rule mining

Journal of Computer Security
Synthesizing heavy association rules from different real data sources

Pattern Recognition Letters
Mining association rules from imprecise ordinal data

Fuzzy Sets and Systems
Privacy-preserving multi-party decision tree induction

International Journal of Business Intelligence and Data Mining
Distributed and Shared Memory Algorithm for Parallel Mining of Association Rules

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Fast Cryptographic Privacy Preserving Association Rules Mining on Distributed Homogenous Data Base

KES '08 Proceedings of the 12th international conference on Knowledge-Based Intelligent Information and Engineering Systems, Part II
An efficient distributed algorithm for canonical labeling on directed split-stars

Discrete Applied Mathematics
Mining fuzzy association rules from questionnaire data

Knowledge-Based Systems
Multirelational classification: a multiple view approach

Knowledge and Information Systems
Mining globally interesting patterns from multiple databases using kernel estimation

Expert Systems with Applications: An International Journal
TidFP: Mining Frequent Patterns in Different Databases with Transaction ID

DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
A load-balanced distributed parallel mining algorithm

Expert Systems with Applications: An International Journal
An efficient algorithm for finding dense regions for mining quantitative association rules

Computers & Mathematics with Applications
From intra-transaction to generalized inter-transaction: Landscaping multidimensional contexts in association rule mining

Information Sciences: an International Journal
A compress-based association mining algorithm for large dataset

ICCS'03 Proceedings of the 2003 international conference on Computational science
Research of distributed data mining association rules model based on similarity

HCI'07 Proceedings of the 12th international conference on Human-computer interaction: applications and services
Performance study of distributed Apriori-like frequent itemsets mining

Knowledge and Information Systems
A rough set approach to mining connections from information systems

Proceedings of the 2010 ACM Symposium on Applied Computing
Protecting privacy in incremental maintenance for distributed association rule mining

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Association rule mining: models and algorithms

Association rule mining: models and algorithms
Research on multi-dimensional association rules mining in distributed environments based on advanced SQL query

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 2
Toward boosting distributed association rule mining by data de-clustering

Information Sciences: an International Journal
Mining fuzzy association rules from uncertain data

Knowledge and Information Systems
POTMiner: mining ordered, unordered, and partially-ordered trees

Knowledge and Information Systems
Mining frequent patterns from XML data: Efficient algorithms and design trade-offs

Expert Systems with Applications: An International Journal
CLAP: Collaborative pattern mining for distributed information systems

Decision Support Systems
Mining interesting XML-enabled association rules with templates

KDID'04 Proceedings of the Third international conference on Knowledge Discovery in Inductive Databases
A privacy preserving mining algorithm on distributed dataset

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Distributed pattern discovery in multiple streams

PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Rule mining for dynamic databases

IWDC'04 Proceedings of the 6th international conference on Distributed Computing
Efficient classification from multiple heterogeneous databases

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Development of a soldering quality classifier system using a hybrid data mining approach

Expert Systems with Applications: An International Journal
Normalised support: a virtual angle of measurement of 'interestingness'

International Journal of Data Analysis Techniques and Strategies
Mining global association rules on an oracle grid by scanning once distributed databases

Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Scalable inductive learning on partitioned data

ISMIS'05 Proceedings of the 15th international conference on Foundations of Intelligent Systems
Rule synthesizing from multiple related databases

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
An efficient distributed algorithm for mining association rules

ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
A multi-agent data mining system for cartel detection in Brazilian government procurement

Expert Systems with Applications: An International Journal
Collusion-Free Privacy Preserving Data Mining

International Journal of Intelligent Information Technologies
Clustering local frequency items in multiple databases

Information Sciences: an International Journal
Predicting re-hospitalisations using intelligent systems: an exploratory study

International Journal of Business Information Systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

Many sequential algorithms have been proposed for mining of association rules. However, very little work has been done in mining association rules in distributed databases. A direct application of sequential algorithms to distributed databases is not effective, because it requires a large amount of communication overhead. In this study, an efficient algorithm, DMA, is proposed. It generates a small number of candidate sets and requires only O(n) messages for support count exchange for each candidate set, where n is the number of sites in a distributed database. The algorithm has been implemented on an experimental test bed and its performance is studied. The results show that DMA has superior performance when comparing with the direct application of a popular sequential algorithm in distributed databases.