A Multilinear Singular Value Decomposition
SIAM Journal on Matrix Analysis and Applications
Clustering by pattern similarity in large data sets
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Discovering local structure in gene expression data: the order-preserving submatrix problem
Proceedings of the sixth annual international conference on Computational biology
Biclustering of Expression Data
Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Mining coherent gene clusters from gene-sample-time microarray data
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Biclustering Algorithms for Biological Data Analysis: A Survey
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
TRICLUSTER: an effective algorithm for mining coherent clusters in 3D microarray data
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Mining Shifting-and-Scaling Co-Regulation Patterns on Gene Expression Profiles
ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Shifting and scaling patterns from gene expression data
Bioinformatics
A Generalized Maximum Entropy Approach to Bregman Co-clustering and Matrix Approximation
The Journal of Machine Learning Research
Hunting for Coherent Co-clusters in High Dimensional and Noisy Datasets
ICDMW '08 Proceedings of the 2008 IEEE International Conference on Data Mining Workshops
Unsupervised induction of modern standard Arabic verb classes
NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Tensor Decompositions and Applications
SIAM Review
Approximation algorithms for tensor clustering
ALT'09 Proceedings of the 20th international conference on Algorithmic learning theory
Bioinformatics
A new framework for co-clustering of gene expression data
PRIB'11 Proceedings of the 6th IAPR international conference on Pattern recognition in bioinformatics
Maximum Block Improvement and Polynomial Optimization
SIAM Journal on Optimization
Parameterized Complexity
Hi-index | 0.00 |
High-throughput techniques are producing large-scale high-dimensional (e.g., 4D with genes vs timepoints vs conditions vs tissues) genome-wide gene expression data. This induces increasing demands for effective methods for partitioning the data into biologically relevant groups. Current clustering and co-clustering approaches have limitations, which may be very time consuming and work for only low-dimensional expression datasets. In this work, we introduce a new notion of "co-identification", which allows systematical identification of genes participating different functional groups under different conditions or different development stages. The key contribution of our work is to build a unified computational framework of co-identification that enables clustering to be high-dimensional and adaptive. Our framework is based upon a generic optimization model and a general optimization method termed Maximum Block Improvement. Testing results on yeast and Arabidopsis expression data are presented to demonstrate high efficiency of our approach and its effectiveness.