CURE: an efficient clustering algorithm for large databases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs
SIAM Journal on Scientific Computing
ROCK: A Robust Clustering Algorithm for Categorical Attributes
ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Hi-index | 0.00 |
Recent developments in information technology have enabled us to deal with large and complex similarity data. Researchers often need to know the cluster structures of such a datasets before constructing inferential models or other such interrogation techniques. To reveal cluster structures, Chamelleon (Karypis et al., 1999) can make subclusters from datasets using graph partition methods and apply hierarchical clustering to reduce the amount of calculations and to reflect the structures in the clusters. Chamelleon can capture arbitrary shaped clusters from similarity data. It can consider intrasimilarities and intersimilarities when two clusters are combined. In addition, the method is robust for outliers whose objects are far from other objects in the same subcluster. However, it cannot detect the cluster structures that cannot be detected by the group average method. This paper proposes a new hierarchical clustering method based on the single linkage method for use when similarity data and subclusters are given. The proposed method has three advantages. First, it considers the intrasimilarities and intersimilarities of some parts in subclusters. Second, it considers the effects of outliers and cluster sizes. Finally, it detects arbitrary shaped cluster structures that cannot be detected by Chamelleon.