Accurate confidence aware clustering of array CGH tumor profiles

  • Authors:
  • Bart P.P. van Houte;Jaap Heringa

  • Affiliations:
  • -;-

  • Venue:
  • Bioinformatics
  • Year:
  • 2010

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: Chromosomal aberrations tend to be characteristic for given (sub)types of cancer. Such aberrations can be detected with array comparative genomic hybridization (aCGH). Clustering aCGH tumor profiles aids in identifying chromosomal regions of interest and provides useful diagnostic information on the cancer type. An important issue here is to what extent individual aCGH tumor profiles can be reliably assigned to clusters associated with a given cancer type. Results: We introduce a novel evolutionary fuzzy clustering (EFC) algorithm, which is able to deal with overlapping clusterings. Our method assesses these overlaps by using cluster membership degrees, which we use here as a confidence measure for individual samples to be assigned to a given tumor type. We first demonstrate the usefulness of our method using a synthetic aCGH dataset and subsequently show that EFC outperforms existing methods on four real datasets of aCGH tumor profiles involving four different cancer types. We also show that in general best performance is obtained using 1-Pearson correlation coefficient as a distance measure and that extra preprocessing steps, such as segmentation and calling, lead to decreased clustering performance. Availability: The source code of the program is available from http://ibi.vu.nl/programs/efcwww Contact: heringa@few.vu.nl Supplementary information:Supplementary data are available at Bioinformatics online.