Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
ACM Computing Surveys (CSUR)
Introduction to data compression (2nd ed.)
Introduction to data compression (2nd ed.)
Sliding-window filtering: an efficient algorithm for incremental mining
Proceedings of the tenth international conference on Information and knowledge management
BIRCH: A New Data Clustering Algorithm and Its Applications
Data Mining and Knowledge Discovery
Toward Multidatabase Mining: Identifying Relevant Databases
IEEE Transactions on Knowledge and Data Engineering
Selecting the right interestingness measure for association patterns
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Dynamic sample selection for approximate query processing
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Knowledge Discovery in Multiple Databases
Knowledge Discovery in Multiple Databases
Unsupervised clustering for nontextual web document classification
Decision Support Systems
Database classification for multi-database mining
Information Systems
Building knowledge discovery-driven models for decision support in project management
Decision Support Systems
Elements of discrete mathematics (McGraw-Hill computer science series)
Elements of discrete mathematics (McGraw-Hill computer science series)
Market basket analysis in a multiple store environment
Decision Support Systems
A logical framework for identifying quality knowledge from different data sources
Decision Support Systems
Efficient classification from multiple heterogeneous databases
PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Capturing association among items in a database
Data & Knowledge Engineering
Measuring influence of an item in a database over time
Pattern Recognition Letters
On the effectiveness of distributed learning on different class-probability distributions of data
CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Clustering local frequency items in multiple databases
Information Sciences: an International Journal
A Framework for Synthesizing Arbitrary Boolean Queries Induced by Frequent Itemsets
International Journal of Knowledge-Based Organizations
Hi-index | 0.01 |
Many large organizations have multiple large databases as they transact from multiple branches. Most of the previous pieces of work are based on a single database. Thus, it is necessary to study data mining on multiple databases. In this paper, we propose two measures of similarity between a pair of databases. Also, we propose an algorithm for clustering a set of databases. Efficiency of the clustering process has been improved using the following strategies: reducing execution time of clustering algorithm, using more appropriate similarity measure, and storing frequent itemsets space efficiently.