Cluster validity index for estimation of fuzzy clusters of different sizes and densities

Authors:
Krista Rizman alik
Affiliations:
University of Maribor, Faculty of Electrical Engineering and Computer Science, Smetanova 17, SI-2000 Maribor, Slovenia
Venue:
Pattern Recognition
Year:
2010

Citing 15
Cited 4

A Validity Measure for Fuzzy Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
A robust algorithm for automatic extraction of an unknown number of clusters from noisy data

Pattern Recognition Letters
On finding the number of clusters

Pattern Recognition Letters
Data clustering: a review

ACM Computing Surveys (CSUR)
A fuzzy hybrid hierarchical clustering method with a new criterion able to find the optimal partition

Fuzzy Sets and Systems - Clustering and modeling
A cluster validity index for fuzzy clustering

Pattern Recognition Letters
New indices for cluster validity assessment

Pattern Recognition Letters
An objective approach to cluster validation

Pattern Recognition Letters
Iterative shrinking method for clustering problems

Pattern Recognition
On fuzzy cluster validity indices

Fuzzy Sets and Systems
Validation criteria for enhanced fuzzy clustering

Pattern Recognition Letters
A cluster validity index for fuzzy clustering

Information Sciences: an International Journal
Numerical methods for fuzzy clustering

Information Sciences: an International Journal
A note on the Gustafson-Kessel and adaptive fuzzy clustering algorithms

IEEE Transactions on Fuzzy Systems
On cluster validity for the fuzzy c-means model

IEEE Transactions on Fuzzy Systems

Validity index for clusters of different sizes and densities

Pattern Recognition Letters
Partitive clustering (K-means family)

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
FUAT - A fuzzy clustering analysis tool

Expert Systems with Applications: An International Journal
Study on multi-center fuzzy C-means algorithm based on transitive closure and spectral clustering

Applied Soft Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

Cluster validity indices are used for estimating the quality of partitions produced by clustering algorithms and for determining the number of clusters in data. Cluster validation is difficult task, because for the same data set more partitions exists regarding the level of details that fit natural groupings of a given data set. Even though several cluster validity indices exist, they are inefficient when clusters widely differ in density or size. We propose a clustering validity index that addresses these issues. It is based on compactness and overlap measures. The overlap measure, which indicates the degree of overlap between fuzzy clusters, is obtained by calculating the overlap rate of all data objects that belong strongly enough to two or more clusters. The compactness measure, which indicates the degree of similarity of data objects in a cluster, is calculated from membership values of data objects that are strongly enough associated to one cluster. We propose ratio and summation type of index using the same compactness and overlap measures. The maximal value of index denotes the optimal fuzzy partition that is expected to have a high compactness and a low degree of overlap among clusters. Testing many well-known previously formulated and proposed indices on well-known data sets showed the superior reliability and effectiveness of the proposed index in comparison to other indices especially when evaluating partitions with clusters that widely differ in size or density.