Automatic Cluster Number Selection Using a Split and Merge K-Means Approach

Authors:
Markus Muhr;Michael Granitzer
Affiliations:
-;-
Venue:
DEXA '09 Proceedings of the 2009 20th International Workshop on Database and Expert Systems Application
Year:
2009

Citing 0
Cited 3

An automated approach for finding variable-constant pairing bugs

Proceedings of the IEEE/ACM international conference on Automated software engineering
Incremental computation of information landscapes for dynamic web interfaces

Proceedings of the IX Symposium on Human Factors in Computing Systems
Dynamic topography information landscapes: an incremental approach to visual knowledge discovery

DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery

Quantified Score

Hi-index	0.00

Visualization

Abstract

The k-means method is a simple and fast clustering technique that exhibits the problem of specifying the optimal number of clusters preliminarily. We address the problem of cluster number selection by using a k-means approach that exploits local changes of internal validity indices to split or merge clusters. Our split and merge k-means issues criterion functions to select clusters to be split or merged and fitness assessments on cluster structure changes. Experiments on standard test data sets show that this approach selects an accurate number of clusters with reasonable runtime and accuracy.