Adaptation in natural and artificial systems
Adaptation in natural and artificial systems
BIRCH: an efficient data clustering method for very large databases
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
CURE: an efficient clustering algorithm for large databases
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Data mining: concepts and techniques
Data mining: concepts and techniques
ROCK: A Robust Clustering Algorithm for Categorical Attributes
ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Java Treeview---extensible visualization of microarray data
Bioinformatics
A new binary classifier: clustering-launched classification
ICIC'06 Proceedings of the 2006 international conference on Intelligent computing: Part II
Hi-index | 0.00 |
Hierarchical Clustering (HC) is not designed to locate the leaf nodes in the tree structure, and therefore is not suitable to locate similarity relation on the sequence of the leaf nodes. In order to generate the similarity relation on tree structure diagram of HC, we proposed an improved solution in this paper; Referential Hierarchical clustering Algorithm (RHA). RHA is a combination of HC, Genetic Algorithm (GA) and Principal Component Analysis (PCA) to resolve the problem of traditional HC. PCA is a technique that reduces high-dimensional dataset to lower dimensions for analysis and reconstructs each data by a suitable linear combination of the principal components. These principal components are ordered by the amount of the variance which is explained in the original dataset. Therefore, RHA adopts GA to find the solution which has the same tree structure with HC and the most similar with the sequence of the samples sorted by increasing value of the first principle components. Experimental results show that the clustering result of RHA exposes the similarity relations between the leaf nodes and the clusters. RHA could be applied to any problem in HC and the generated tree diagram could assist researchers to compare and analyze each sample and find the relations between the clusters more easily and quickly.