BIRCH: an efficient data clustering method for very large databases
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Knowledge Acquisition Via Incremental Conceptual Clustering
Machine Learning
Integration of Data Mining with Database Technology
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Hi-index | 0.01 |
We have developed a clustering algorithm called CLIMIS to demonstrate the advantages of implementing a data mining algorithm in a database management system (DBMS). CLIMIS clusters data held in a DBMS, stores the resulting clusters in the DBMS and executes inside the DBMS. By tightly coupling CLIMIS with the database environment the algorithm scales better to large databases. This is achieved through an index-like structure that uses the database to overcome memory limitations. We further improve the performance of the algorithm by using a technique called adaptive clustering, which controls the size of the clusters.