gTRICLUSTER: a more general and effective 3d clustering algorithm for gene-sample-time microarray data

  • Authors:
  • Haoliang Jiang;Shuigeng Zhou;Jihong Guan;Ying Zheng

  • Affiliations:
  • Department of Computer Science and Engineering, Fudan University;Department of Computer Science and Engineering, Fudan University;Dept. of Computer Sci. and Technol., Tongji University, Shanghai, China;Department of Computer Science and Engineering, Fudan University

  • Venue:
  • BioDM'06 Proceedings of the 2006 international conference on Data Mining for Biomedical Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering is an important technique in microarray data analysis, and mining three-dimensional (3D) clusters in gene-sample-time (simply GST) microarray data is emerging as a hot research topic in this area. A 3D cluster consists of a subset of genes that are coherent on a subset of samples along a segment of time series. This kind of coherent clusters may contain information for the users to identify useful phenotypes, potential genes related to these phenotypes and their expression rules. TRICLUSTER is the state-of-the-art 3D clustering algorithm for GST microarray data. In this paper, we propose a new algorithm to mine 3D clusters over GST microarray data. We term the new algorithm gTRICLUSTER because it is based on a more general 3D cluster model than the one that TRICLUSTER is based on. gTRICLUSTER can find more biologically meaningful coherent gene clusters than TRICLUSTER can do. It also outperforms TRICLUSTER in robustness to noise. Experimental results on a real-world microarray dataset validate the effectiveness of the proposed new algorithm.