Algorithms for clustering data
Algorithms for clustering data
A New Nonparametric Pairwise Clustering Algorithm Based on Iterative Estimation of Distance Profiles
Machine Learning - Special issue: Unsupervised learning
The Journal of Machine Learning Research
Hi-index | 0.14 |
Clustering algorithms have the annoying characteristic of finding clusters in random data. A theoretical analysis of the threshold of the mutual neighborhood clustering algorithm (MNCA) under the hypothesis of random data is presented. This yields a theoretical minimum value of this threshold below which even unclustered data are broken into separate clusters. To derive the threshold, a theorem about mutual near neighbors in a Poisson process is stated and proved. Simple experiments demonstrate the usefulness of the theoretical thresholds.