Linear Coherent Bi-cluster Discovery via Line Detection and Sample Majority Voting

Authors:
Yi Shi;Zhipeng Cai;Guohui Lin;Dale Schuurmans
Affiliations:
Department of Computing Science, University of Alberta, Edmonton, T6G 2E8;Department of Computing Science, University of Alberta, Edmonton, T6G 2E8;Department of Computing Science, University of Alberta, Edmonton, T6G 2E8;Department of Computing Science, University of Alberta, Edmonton, T6G 2E8
Venue:
COCOA '09 Proceedings of the 3rd International Conference on Combinatorial Optimization and Applications
Year:
2009

Citing 6
Cited 1

Discovering local structure in gene expression data: the order-preserving submatrix problem

Proceedings of the sixth annual international conference on Computational biology
Biclustering of Expression Data

Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Biclustering Algorithms for Biological Data Analysis: A Survey

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Defining transcription modules using large-scale gene expression data

Bioinformatics
A systematic comparison and evaluation of biclustering methods for gene expression data

Bioinformatics
Computing the maximum similarity bi-clusters of gene expression data

Bioinformatics

Sparse learning based linear coherent bi-clustering

WABI'12 Proceedings of the 12th international conference on Algorithms in Bioinformatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Discovering groups of genes that share common expression profiles is an important problem in DNA microarray analysis. Unfortunately, standard bi-clustering algorithms often fail to retrieve common expression groups because (1) genes only exhibit similar behaviors over a subset of conditions, and (2) genes may participate in more than one functional process and therefore belong to multiple groups. Many algorithms have been proposed to address these problems in the past decade; however, in addition to the above challenges most such algorithms are unable to discover linear coherent bi-clusters--a strict generalization of additive and multiplicative bi-clustering models. In this paper, we propose a novel bi-clustering algorithm that discovers linear coherent bi-clusters, based on first detecting linear correlations between pairs of gene expression profiles, then identifying groups by sample majority voting. Our experimental results on both synthetic and two real datasets, Saccharomyces cerevisiae and Arabidopsis thaliana , show significant performance improvements over previous methods. One intriguing aspect of our approach is that it can easily be extended to identify bi-clusters of more complex gene-gene correlations.