Mean squared residue based biclustering algorithms

  • Authors:
  • Stefan Gremalschi;Gulsah Altun

  • Affiliations:
  • Department of Computer Science, Georgia State University, Atlanta, GA;Department of Computer Science, Georgia State University, Atlanta, GA

  • Venue:
  • ISBRA'08 Proceedings of the 4th international conference on Bioinformatics research and applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The availability of large microarray data has brought along many challengesfor biological data mining. Following Cheng and Church [4], many differentbiclustering methods have been widely used to find appropriate subsets ofexperimental conditions. Still no paper directly optimizes or bounds the MeanSquared Residue (MSR) originally suggested by Cheng and Church. Their algorithm,for a given expression matrix A and an upper bound on MSR, finds kalmost non overlapping biclusters whose sizes are not predefined thus making itdifficult to compare with other methods. In this paper, we propose two new Mean Squared Residue (MSR) based biclusteringmethods. The first method is a dual biclustering algorithm which finds(k × l)-bicluster with MSR using a greedy approach. The second method combinesdual biclustering algorithm with quadratic programming. The dual biclusteringalgorithm reduces the size of the matrix, so that the quadratic programcan find an optimal bicluster reasonably fast. We control bicluster overlappingby changing the penalty for reusing cells in biclusters. The average MSR in [4]biclusterings for yeast is almost the same as for the proposed dual biclusteringwhile the median MSR is 1.5 times larger thus implying that the quadratic programfinds much better smaller biclusters.