Efficient clustering of databases induced by local patterns

Authors:
Animesh Adhikari;P. R. Rao
Affiliations:
Department of Computer Science, S P Chowgule College, Margao, Goa 403 602, India;Department of Computer Science and Technology, Goa University, Goa 403 206, India
Venue:
Decision Support Systems
Year:
2008

Citing 16
Cited 5

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Data clustering: a review

ACM Computing Surveys (CSUR)
Introduction to data compression (2nd ed.)

Introduction to data compression (2nd ed.)
Sliding-window filtering: an efficient algorithm for incremental mining

Proceedings of the tenth international conference on Information and knowledge management
BIRCH: A New Data Clustering Algorithm and Its Applications

Data Mining and Knowledge Discovery
Toward Multidatabase Mining: Identifying Relevant Databases

IEEE Transactions on Knowledge and Data Engineering
Selecting the right interestingness measure for association patterns

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Dynamic sample selection for approximate query processing

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Knowledge Discovery in Multiple Databases

Knowledge Discovery in Multiple Databases
Unsupervised clustering for nontextual web document classification

Decision Support Systems
Database classification for multi-database mining

Information Systems
Building knowledge discovery-driven models for decision support in project management

Decision Support Systems
Elements of discrete mathematics (McGraw-Hill computer science series)

Elements of discrete mathematics (McGraw-Hill computer science series)
Market basket analysis in a multiple store environment

Decision Support Systems
A logical framework for identifying quality knowledge from different data sources

Decision Support Systems
Efficient classification from multiple heterogeneous databases

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases

Capturing association among items in a database

Data & Knowledge Engineering
Measuring influence of an item in a database over time

Pattern Recognition Letters
On the effectiveness of distributed learning on different class-probability distributions of data

CAEPIA'11 Proceedings of the 14th international conference on Advances in artificial intelligence: spanish association for artificial intelligence
Clustering local frequency items in multiple databases

Information Sciences: an International Journal
A Framework for Synthesizing Arbitrary Boolean Queries Induced by Frequent Itemsets

International Journal of Knowledge-Based Organizations

Quantified Score

Hi-index	0.01

Visualization

Abstract

Many large organizations have multiple large databases as they transact from multiple branches. Most of the previous pieces of work are based on a single database. Thus, it is necessary to study data mining on multiple databases. In this paper, we propose two measures of similarity between a pair of databases. Also, we propose an algorithm for clustering a set of databases. Efficiency of the clustering process has been improved using the following strategies: reducing execution time of clustering algorithm, using more appropriate similarity measure, and storing frequent itemsets space efficiently.