Heat Map Visualizations Allow Comparison of Multiple Clustering Results and Evaluation of Dataset Quality: Application to Microarray Data

  • Authors:
  • John Sharko;Georges G. Grinstein;Kenneth A. Marx;Jianping Zhou;Chia-Ho Cheng;Shannon Odelberg;Hans-Georg Simon

  • Affiliations:
  • University of Massachusetts Lowell;University of Massachusetts Lowell;University of Massachusetts Lowell;University of Massachusetts Lowell;University of Massachusetts Lowell;University of Utah;Northwestern University, Chicago, IL.

  • Venue:
  • IV '07 Proceedings of the 11th International Conference Information Visualization
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Since clustering algorithms are heuristic, multiple clustering algorithms applied to the same dataset will typically not generate the same sets of clusters. This is especially true for complex datasets such as those from microarray time series experiments. Two such microarray datasets describing gene expression activities from regenerating newt forelimbs at various times following limb amputation were used in this study. A cluster stability matrix, which shows the number of times two genes appear in the same cluster, was generated as a heat map. This was used to evaluate the overall variation among the clustering algorithms and to identify similar clusters. A comparison of the cluster stability matrices for two related microarray experiments with different levels of precision was shown to be an effective basis for comparing the quality of the two sets of experiments. A pairwise heat map was generated to show which pairs of clustering algorithms grouped the data into similar clusters.