A Validity Measure for Fuzzy Clustering
IEEE Transactions on Pattern Analysis and Machine Intelligence
A robust algorithm for automatic extraction of an unknown number of clusters from noisy data
Pattern Recognition Letters
On finding the number of clusters
Pattern Recognition Letters
ACM Computing Surveys (CSUR)
Fuzzy Sets and Systems - Clustering and modeling
A cluster validity index for fuzzy clustering
Pattern Recognition Letters
New indices for cluster validity assessment
Pattern Recognition Letters
An objective approach to cluster validation
Pattern Recognition Letters
Iterative shrinking method for clustering problems
Pattern Recognition
On fuzzy cluster validity indices
Fuzzy Sets and Systems
Validation criteria for enhanced fuzzy clustering
Pattern Recognition Letters
A cluster validity index for fuzzy clustering
Information Sciences: an International Journal
Numerical methods for fuzzy clustering
Information Sciences: an International Journal
A note on the Gustafson-Kessel and adaptive fuzzy clustering algorithms
IEEE Transactions on Fuzzy Systems
On cluster validity for the fuzzy c-means model
IEEE Transactions on Fuzzy Systems
Validity index for clusters of different sizes and densities
Pattern Recognition Letters
Partitive clustering (K-means family)
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
FUAT - A fuzzy clustering analysis tool
Expert Systems with Applications: An International Journal
Hi-index | 0.01 |
Cluster validity indices are used for estimating the quality of partitions produced by clustering algorithms and for determining the number of clusters in data. Cluster validation is difficult task, because for the same data set more partitions exists regarding the level of details that fit natural groupings of a given data set. Even though several cluster validity indices exist, they are inefficient when clusters widely differ in density or size. We propose a clustering validity index that addresses these issues. It is based on compactness and overlap measures. The overlap measure, which indicates the degree of overlap between fuzzy clusters, is obtained by calculating the overlap rate of all data objects that belong strongly enough to two or more clusters. The compactness measure, which indicates the degree of similarity of data objects in a cluster, is calculated from membership values of data objects that are strongly enough associated to one cluster. We propose ratio and summation type of index using the same compactness and overlap measures. The maximal value of index denotes the optimal fuzzy partition that is expected to have a high compactness and a low degree of overlap among clusters. Testing many well-known previously formulated and proposed indices on well-known data sets showed the superior reliability and effectiveness of the proposed index in comparison to other indices especially when evaluating partitions with clusters that widely differ in size or density.