An iterative initial-points refinement algorithm for categorical data clustering
Pattern Recognition Letters
Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values
Data Mining and Knowledge Discovery
Unsupervised Learning with Mixed Numeric and Nominal Data
IEEE Transactions on Knowledge and Data Engineering
Feature Weighting in k-Means Clustering
Machine Learning
Fuzzy clustering of categorical data using fuzzy centroids
Pattern Recognition Letters
Automated Variable Weighting in k-Means Type Clustering
IEEE Transactions on Pattern Analysis and Machine Intelligence
On the Impact of Dissimilarity Measure in k-Modes Clustering Algorithm
IEEE Transactions on Pattern Analysis and Machine Intelligence
A genetic fuzzy k-Modes algorithm for clustering categorical data
Expert Systems with Applications: An International Journal
Electricity based external similarity of categorical attributes
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Association-Based dissimilarity measures for categorical data: limitation and improvement
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
Improving k-modes algorithm considering frequencies of attribute values in mode
CIS'05 Proceedings of the 2005 international conference on Computational Intelligence and Security - Volume Part I
A fuzzy k-modes algorithm for clustering categorical data
IEEE Transactions on Fuzzy Systems
Rough set based fuzzy k-modes for categorical data
SEMCCO'12 Proceedings of the Third international conference on Swarm, Evolutionary, and Memetic Computing
Hi-index | 12.05 |
In this paper, we generalize the k-modes clustering algorithm by weighting attribute value in the dissimilarity computation. Such a generalization generates clusters with stronger intra-similarities, leading to better clustering performance. Experimental results on real life data show that the new k-modes algorithm is superior to the standard k-modes algorithm with respect to clustering accuracy.