Application of Simulated Annealing to the Biclustering of Gene Expression Data

  • Authors:
  • K. Bryan;P. Cunningham;N. Bolshakova

  • Affiliations:
  • Machine Learning Group, Trinity Coll. Dublin;-;-

  • Venue:
  • IEEE Transactions on Information Technology in Biomedicine
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In a gene expression data matrix, a bicluster is a submatrix of genes and conditions that exhibits a high correlation of expression activity across both rows and columns. The problem of locating the most significant bicluster has been shown to be NP-complete. Heuristic approaches such as Cheng and Church's greedy node deletion algorithm have been previously employed. It is to be expected that stochastic search techniques such as evolutionary algorithms or simulated annealing might improve upon such greedy techniques. In this paper we show that an approach based on simulated annealing is well suited to this problem, and we present a comparative evaluation of simulated annealing and node deletion on a variety of datasets. We show that simulated annealing discovers more significant biclusters in many cases. Furthermore, we also test the ability of our technique to locate biologically verifiable biclusters within an annotated set of genes