A unified adaptive co-identification framework for high-d expression data

Authors:
Shuzhong Zhang;Kun Wang;Cody Ashby;Bilian Chen;Xiuzhen Huang
Affiliations:
University of Minnesota, Minneapolis, MN;Arkansas State University, Jonesboro, AR;Arkansas State University, Jonesboro, AR;Xiamen University, Xiamen, China;Arkansas State University, Jonesboro, AR
Venue:
PRIB'12 Proceedings of the 7th IAPR international conference on Pattern Recognition in Bioinformatics
Year:
2012

Citing 19
Cited 0

A Multilinear Singular Value Decomposition

SIAM Journal on Matrix Analysis and Applications
Clustering by pattern similarity in large data sets

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Discovering local structure in gene expression data: the order-preserving submatrix problem

Proceedings of the sixth annual international conference on Computational biology
Biclustering of Expression Data

Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Mining coherent gene clusters from gene-sample-time microarray data

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Biclustering Algorithms for Biological Data Analysis: A Survey

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Mining Shifting-and-Scaling Co-Regulation Patterns on Gene Expression Profiles

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Shifting and scaling patterns from gene expression data

Bioinformatics
A systematic comparison and evaluation of biclustering methods for gene expression data

Bioinformatics
A Generalized Maximum Entropy Approach to Bregman Co-clustering and Matrix Approximation

The Journal of Machine Learning Research
Hunting for Coherent Co-clusters in High Dimensional and Noisy Datasets

ICDMW '08 Proceedings of the 2008 IEEE International Conference on Data Mining Workshops
Unsupervised induction of modern standard Arabic verb classes

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Tensor Decompositions and Applications

SIAM Review
Approximation algorithms for tensor clustering

ALT'09 Proceedings of the 20th international conference on Algorithmic learning theory
FABIA

Bioinformatics
A new framework for co-clustering of gene expression data

PRIB'11 Proceedings of the 6th IAPR international conference on Pattern recognition in bioinformatics
Maximum Block Improvement and Polynomial Optimization

SIAM Journal on Optimization
Parameterized Complexity

Parameterized Complexity

Quantified Score

Hi-index	0.00

Visualization

Abstract

High-throughput techniques are producing large-scale high-dimensional (e.g., 4D with genes vs timepoints vs conditions vs tissues) genome-wide gene expression data. This induces increasing demands for effective methods for partitioning the data into biologically relevant groups. Current clustering and co-clustering approaches have limitations, which may be very time consuming and work for only low-dimensional expression datasets. In this work, we introduce a new notion of "co-identification", which allows systematical identification of genes participating different functional groups under different conditions or different development stages. The key contribution of our work is to build a unified computational framework of co-identification that enables clustering to be high-dimensional and adaptive. Our framework is based upon a generic optimization model and a general optimization method termed Maximum Block Improvement. Testing results on yeast and Arabidopsis expression data are presented to demonstrate high efficiency of our approach and its effectiveness.