A Comparison of Fuzzy Clustering Approaches for Quantification of Microarray Gene Expression

  • Authors:
  • Yu-Ping Wang;Maheswar Gunampally;Jie Chen;Douglas Bittel;Merlin G. Butler;Wei-Wen Cai

  • Affiliations:
  • School of Computing and Engineering, University of Missouri, Kansas City, USA 64110;School of Computing and Engineering, University of Missouri, Kansas City, USA 64110;Department of Mathematics and Statistics, Kansas City, USA 64110;Children's Mercy Hospital and Clinics, School of Medicine, University of Missouri, Kansas City, USA 64108;Children's Mercy Hospital and Clinics, School of Medicine, University of Missouri, Kansas City, USA 64108;Department of Human Molecular Genetics, Baylor College of Medicine, Houston, USA 77005

  • Venue:
  • Journal of Signal Processing Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Despite the widespread application of microarray imaging for biomedical imaging research, barriers still exist regarding its reliability for clinical use. A critical major problem lies in accurate spot segmentation and the quantification of gene expression level (mRNA) from the microarray images. A variety of commercial and research freeware packages are available, but most cannot handle array spots with complex shapes such as donuts and scratches. Clustering approaches such as k-means and mixture models were introduced to overcome this difficulty, which use the hard labeling of each pixel. In this paper, we apply fuzzy clustering approaches for spot segmentation, which provides soft labeling of the pixel. We compare several fuzzy clustering approaches for microarray analysis and provide a comprehensive study of these approaches for spot segmentation. We show that possiblistic c-means clustering (PCM) provides the best performance in terms of stability criterion when testing on both a variety of simulated and real microarray images. In addition, we compared three statistical criteria in measuring gene expression levels and show that a new asymptotically unbiased statistic is able to quantify the gene expression level more accurately.