Tendency curves for visual clustering assessment
ACC'08 Proceedings of the WSEAS International Conference on Applied Computing Conference
An algorithm for clustering tendency assessment
WSEAS Transactions on Mathematics
Is VAT really single linkage in disguise?
Annals of Mathematics and Artificial Intelligence
Fuzzy PCA-guided robust k-means clustering
IEEE Transactions on Fuzzy Systems
WCCI'08 Proceedings of the 2008 IEEE world conference on Computational intelligence: research frontiers
Relational generalizations of cluster validity indices
IEEE Transactions on Fuzzy Systems
Clustering ellipses for anomaly detection
Pattern Recognition
ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
International Journal of Knowledge Engineering and Soft Data Paradigms
Relational duals of cluster-validity functions for the c-means family
IEEE Transactions on Fuzzy Systems
Tuning graded possibilistic clustering by visual stability analysis
WILF'11 Proceedings of the 9th international conference on Fuzzy logic and applications
iVAT and aVAT: enhanced visual analysis for cluster tendency assessment
PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
International Journal of Intelligent Systems
Objective function-based clustering
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Discovering inherent event taxonomies from social media collections
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Hi-index | 0.00 |
We have an m times n matrix D, and assume that its entries correspond to pair wise dissimilarities between m row objects Or and n column objects Oc, which, taken together (as a union), comprise a set O of N = m + n objects. This paper develops a new visual approach that applies to four different cluster assessment problems associated with O. The problems are the assessment of cluster tendency: PI) amongst the row objects Or; P2) amongst the column objects Oc; P3) amongst the union of the row and column objects Or U Oc; and P4) amongst the union of the row and column objects that contain at least one object of each type (co-clusters). The basis of the method is to regard D as a subset of known values that is part of a larger, unknown N times N dissimilarity matrix, and then impute the missing values from D. This results in estimates for three square matrices (Dr, Dc, DrUc) that can be visually assessed for clustering tendency using the previous VAT or sVAT algorithms. The output from assessment of DrUc ultimately leads to a rectangular coVAT image which exhibits clustering tendencies in D. Five examples are given to illustrate the new method. Two important points: i) because VAT is scalable by sVAT to data sets of arbitrary size, and because coVAT depends explicitly (and only) on VAT, this new approach is immediately scalable to, say, the scoVAT model, which works for even very large (unloadable) data sets without alteration; and ii) VAT, sVAT and coVAT are autonomous, parameter free models - no "hidden values" are needed to make them work.