GraphClus, a MATLAB program for cluster analysis using graph theory

  • Authors:
  • Clifford S. Todd;Tivadar M Toth;Róbert Busa-Fekete

  • Affiliations:
  • Department of Geology & Geophysics, University of Hawaii, 1680 East-West Road, Honolulu, HI 96822, USA;Department of Mineralogy, Geochemistry and Petrology, University of Szeged, P.O. Box 651, H-6721 Szeged, Hungary;Research Group on Artificial Intelligence, University of Szeged, P.O. Box 652, H-6721 Szeged, Hungary

  • Venue:
  • Computers & Geosciences
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Cluster analysis is used in numerous scientific disciplines. A method of cluster analysis based on graph theory is discussed and a MATLAB(TM) code for its implementation is presented. The algorithm is based on the number of variables that are similar between samples. By changing the similarity criterion in a stepwise fashion, a hierarchical group structure develops, and can be displayed by a dendrogram. Three indexes describe the homogeneity of a given variable in a group, the heterogeneity of that variable between two groups, and the usefulness of that variable in distinguishing two groups. The algorithm is applied to both a synthetic dataset and a set of trace element analyses of lavas from Mount Etna in order to compare GraphClus to other cluster analysis algorithms.