Biclustering of Expression Data Using Simulated Annealing

  • Authors:
  • Kenneth Bryan

  • Affiliations:
  • Trinity College Dublin

  • Venue:
  • CBMS '05 Proceedings of the 18th IEEE Symposium on Computer-Based Medical Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In a gene expression data matrix a bicluster is a grouping of a subset of genes and a subset of conditions which show correlating levels of expression activity. The difficulty of finding significant biclusters in gene expression data grows exponentially with the size of the dataset and heuristic approaches such as Cheng and Churchýs greedy node deletion algorithm are required. It is to be expected that stochastic search techniques such as Genetic Algorithms or Simulated Annealing might produce better solutions than greedy search. In this paper we show that a Simulated Annealing approach is well suited to this problem and we present a comparative evaluation of Simulated Annealing and node deletion on a variety of datasets. We show that Simulated Annealing discovers more significant biclusters in many cases.