Applied multivariate techniques
Applied multivariate techniques
CURE: an efficient clustering algorithm for large databases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
ACM Computing Surveys (CSUR)
Clustering validity checking methods: part II
ACM SIGMOD Record
On Clustering Validation Techniques
Journal of Intelligent Information Systems
Clustering Validity Assessment: Finding the Optimal Partitioning of a Data Set
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Pattern Recognition, Third Edition
Pattern Recognition, Third Edition
FIE'09 Proceedings of the 39th IEEE international conference on Frontiers in education conference
Record linkage with uniqueness constraints and erroneous values
Proceedings of the VLDB Endowment
Best clustering configuration metrics: towards multiagent based clustering
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications: Part I
Linking records in dynamic world
PhD '12 Proceedings of the on SIGMOD/PODS 2012 PhD Symposium
Neighborhood-Based smoothing of external cluster validity measures
PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Similar or not similar: this is a parameter question
HCI International'13 Proceedings of the 15th international conference on Human Interface and the Management of Information: information and interaction design - Volume Part I
Hi-index | 0.00 |
Clustering is a process of discovering groups of objects such that the objects of the same group are similar, and the objects belonging to different groups are dissimilar. Several research fields deal with the problem of clustering: for example pattern recognition, data mining, machine learning. A number of algorithms exist that can solve the problem of clustering, but most of them are very sensitive to their input parameters. Therefore it is very important to evaluate the result of the clustering algorithms. It is difficult to define whether a clustering result is acceptable or not, thus several clustering validity techniques and indices have been developed. This paper deals with the problem of clustering validity. The most commonly used validity indices are introduced and explained, and they are compared based on experimental results.