Identifying sub-populations via unsupervised cluster analysis on multi-edge similarity graphs

  • Authors:
  • Madhura Ingalhalikar;Alex R. Smith;Luke Bloy;Ruben Gur;Timothy P. L. Roberts;Ragini Verma

  • Affiliations:
  • Section of Biomedical Image Analysis, University of Pennsylvania, Philadelphia, PA;Section of Biomedical Image Analysis, University of Pennsylvania, Philadelphia, PA;Section of Biomedical Image Analysis, University of Pennsylvania, Philadelphia, PA;Brain Behavior Laboratory, University of Pennsylvania, Philadelphia, PA;Lurie Family Foundation's MEG Imaging Center, Department of Radiology, Children's Hospital of Philadelphia, Philadelphia, PA;Section of Biomedical Image Analysis, University of Pennsylvania, Philadelphia, PA

  • Venue:
  • MICCAI'12 Proceedings of the 15th international conference on Medical Image Computing and Computer-Assisted Intervention - Volume Part II
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Pathologies like autism and schizophrenia are a broad set of disorders with multiple etiologies in the same diagnostic category. This paper presents a method for unsupervised cluster analysis using multi-edge similarity graphs that combine information from different modalities. The method alleviates the issues with traditional supervised classification methods that use diagnostic labels and are therefore unable to exploit or elucidate the underlying heterogeneity of the dataset under analysis. The framework introduced in this paper has the ability to employ diverse features that define different aspects of pathology obtained from different modalities to create a multi-edged graph on which clustering is performed. The weights on the multiple edges are optimized using a novel concept of 'holding power' that describes the certainty with which a subject belongs to a cluster. We apply the technique to two separate clinical populations of autism spectrum disorder (ASD) and schizophrenia (SCZ), where the multi-edged graph for each population is created by combining information from structural networks and cognitive scores. For the ASD-control population the method clusters the data into two classes and the SCZ-control population is clustered into four. The two classes in ASD agree with underlying diagnostic labels with 92% accuracy and the SCZ clustering agrees with 78% accuracy, indicating a greater heterogeneity in the SCZ population.