Gene Expression Data Analysis Using a Novel Approach to Biclustering Combining Discrete and Continuous Data

Authors:
Yann Christinat;Bernd Wachmann;Lei Zhang
Affiliations:
-;-;-
Venue:
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Year:
2008

Citing 12
Cited 1

Discovering local structure in gene expression data: the order-preserving submatrix problem

Proceedings of the sixth annual international conference on Computational biology
Biclustering of Expression Data

Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Enhanced Biclustering on Expression Data

BIBE '03 Proceedings of the 3rd IEEE Symposium on BioInformatics and BioEngineering
Learning from Order Examples

ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Biclustering Algorithms for Biological Data Analysis: A Survey

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Evolutionary computation for biclustering of gene expression

Proceedings of the 2005 ACM symposium on Applied computing
Biclustering of Expression Data Using Simulated Annealing

CBMS '05 Proceedings of the 18th IEEE Symposium on Computer-Based Medical Systems
Discovering Coherent Biclusters from Gene Expression Data Using Zero-Suppressed Binary Decision Diagrams

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Multi-Class Biclustering and Classification Based on Modeling of Gene Regulatory Networks

BIBE '05 Proceedings of the Fifth IEEE Symposium on Bioinformatics and Bioengineering
BicAT: a biclustering analysis toolbox

Bioinformatics
A systematic comparison and evaluation of biclustering methods for gene expression data

Bioinformatics
Learning to order things

Journal of Artificial Intelligence Research

BiMine+: An efficient algorithm for discovering relevant biclusters of DNA microarray data

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many different methods exist for pattern detection in gene expression data. In contrast to classical methods, biclustering has the ability to cluster a group of genes together with a group of conditions (replicates, set of patients or drug compounds). However, since the problem is NP-complex, most algorithms use heuristic search functions and therefore might converge towards local maxima. By using the results of biclustering on discrete data as a starting point for a local search function on continuous data, our algorithm avoids the problem of heuristic initialization. Similar to OPSM, our algorithm aims to detect biclusters whose rows and columns can be ordered such that row values are growing across the bicluster's columns and vice-versa. Results have been generated on the yeast genome (Saccharomyces cerevisiae), a human cancer dataset and random data. Results on the yeast genome showed that 89% of the one hundred biggest non-overlapping biclusters were enriched with Gene Ontology annotations. A comparison with OPSM and ISA demonstrated a better efficiency when using gene and condition orders. We present results on random and real datasets that show the ability of our algorithm to capture statistically significant and biologically relevant biclusters.