Efficient clustering of databases induced by local patterns

  • Authors:
  • Animesh Adhikari;P. R. Rao

  • Affiliations:
  • Department of Computer Science, S P Chowgule College, Margao, Goa 403 602, India;Department of Computer Science and Technology, Goa University, Goa 403 206, India

  • Venue:
  • Decision Support Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.01

Visualization

Abstract

Many large organizations have multiple large databases as they transact from multiple branches. Most of the previous pieces of work are based on a single database. Thus, it is necessary to study data mining on multiple databases. In this paper, we propose two measures of similarity between a pair of databases. Also, we propose an algorithm for clustering a set of databases. Efficiency of the clustering process has been improved using the following strategies: reducing execution time of clustering algorithm, using more appropriate similarity measure, and storing frequent itemsets space efficiently.