Visual Assessment of Clustering Tendency for Rectangular Dissimilarity Matrices

Authors:
J. C. Bezdek;R. J. Hathaway;J. M. Huband
Affiliations:
Univ. of West Florida, Pensacola;-;-
Venue:
IEEE Transactions on Fuzzy Systems
Year:
2007

Citing 0
Cited 15

Tendency curves for visual clustering assessment

ACC'08 Proceedings of the WSEAS International Conference on Applied Computing Conference
An algorithm for clustering tendency assessment

WSEAS Transactions on Mathematics
Is VAT really single linkage in disguise?

Annals of Mathematics and Artificial Intelligence
Fuzzy PCA-guided robust k-means clustering

IEEE Transactions on Fuzzy Systems
VCV2: visual cluster validity

WCCI'08 Proceedings of the 2008 IEEE world conference on Computational intelligence: research frontiers
Relational generalizations of cluster validity indices

IEEE Transactions on Fuzzy Systems
Clustering ellipses for anomaly detection

Pattern Recognition
A new implementation of the co-VAT algorithm for visual assessment of clusters in rectangular relational data

ICAISC'10 Proceedings of the 10th international conference on Artificial intelligence and soft computing: Part I
Collaborative filtering by sequential user-item co-cluster extraction from rectangular relational data

International Journal of Knowledge Engineering and Soft Data Paradigms
Relational duals of cluster-validity functions for the c-means family

IEEE Transactions on Fuzzy Systems
Tuning graded possibilistic clustering by visual stability analysis

WILF'11 Proceedings of the 9th international conference on Fuzzy logic and applications
iVAT and aVAT: enhanced visual analysis for cluster tendency assessment

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
A new formulation of the coVAT algorithm for visual assessment of clustering tendency in rectangular data

International Journal of Intelligent Systems
Objective function-based clustering

Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Discovering inherent event taxonomies from social media collections

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

We have an m times n matrix D, and assume that its entries correspond to pair wise dissimilarities between m row objects Or and n column objects Oc, which, taken together (as a union), comprise a set O of N = m + n objects. This paper develops a new visual approach that applies to four different cluster assessment problems associated with O. The problems are the assessment of cluster tendency: PI) amongst the row objects Or; P2) amongst the column objects Oc; P3) amongst the union of the row and column objects Or U Oc; and P4) amongst the union of the row and column objects that contain at least one object of each type (co-clusters). The basis of the method is to regard D as a subset of known values that is part of a larger, unknown N times N dissimilarity matrix, and then impute the missing values from D. This results in estimates for three square matrices (Dr, Dc, DrUc) that can be visually assessed for clustering tendency using the previous VAT or sVAT algorithms. The output from assessment of DrUc ultimately leads to a rectangular coVAT image which exhibits clustering tendencies in D. Five examples are given to illustrate the new method. Two important points: i) because VAT is scalable by sVAT to data sets of arbitrary size, and because coVAT depends explicitly (and only) on VAT, this new approach is immediately scalable to, say, the scoVAT model, which works for even very large (unloadable) data sets without alteration; and ii) VAT, sVAT and coVAT are autonomous, parameter free models - no "hidden values" are needed to make them work.