An association-based dissimilarity measure for categorical data
Pattern Recognition Letters
Attribute value weighting in k-modes clustering
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Measuring the similarity for categorical data is a challenging task in data mining due to the poor structure of categorical data. This paper presents a dissimilarity measure for categorical data based on the relations among attributes. This measure not only has the advantage of value variance but also overcomes the limitations of condition the probability-based measure when applied to databases whose attributes are independent. Experiments with 30 databases also showed that the proposed measure boosted the accuracy of Nearest Neighbor classification in comparison with other tested measures.