Ontology-Driven Co-clustering of Gene Expression Data

Authors:
Francesca Cordero;Ruggero G. Pensa;Alessia Visconti;Dino Ienco;Marco Botta
Affiliations:
Department of Clinical and Biological Sciences, University of Torino, and Department of Computer Science, University of Torino, and Center for Complex Systems in Molecular Biology and Medicine - S ...;Department of Computer Science, University of Torino,;Department of Computer Science, University of Torino, and Center for Complex Systems in Molecular Biology and Medicine - SysBioM, University of Torino,;Department of Computer Science, University of Torino, and Center for Complex Systems in Molecular Biology and Medicine - SysBioM, University of Torino,;Department of Computer Science, University of Torino, and Center for Complex Systems in Molecular Biology and Medicine - SysBioM, University of Torino,
Venue:
AI*IA '09: Proceedings of the XIth International Conference of the Italian Association for Artificial Intelligence Reggio Emilia on Emergent Perspectives in Artificial Intelligence
Year:
2009

Citing 7
Cited 0

Biclustering of Expression Data

Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology
Cluster ensembles --- a knowledge reuse framework for combining multiple partitions

The Journal of Machine Learning Research
Biclustering Algorithms for Biological Data Analysis: A Survey

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Determining the Number of Clusters/Segments in Hierarchical Clustering/Segmentation Algorithms

ICTAI '04 Proceedings of the 16th IEEE International Conference on Tools with Artificial Intelligence
Hypothesis-driven approach to predict transcriptional units from gene expression data

Bioinformatics
Co-clustering and visualization of gene expression data and gene ontology terms for Saccharomyces cerevisiae using self-organizing maps

Journal of Biomedical Informatics
A new protein motif extraction framework based on constrained co-clustering

Proceedings of the 2009 ACM symposium on Applied Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

The huge volume of gene expression data produced by microarrays and other high-throughput techniques has encouraged the development of new computational techniques to evaluate the data and to formulate new biological hypotheses. To this purpose, co-clustering techniques are widely used: these identify groups of genes that show similar activity patterns under a specific subset of the experimental conditions by measuring the similarity in expression within these groups. However, in many applications, distance metrics based only on expression levels fail in capturing biologically meaningful clusters. We propose a methodology in which a standard expression-based co-clustering algorithm is enhanced by sets of constraints which take into account the similarity/dissimilarity (inferred by the Gene Ontology, GO) between pairs of genes. Our approach minimizes the intervention of the analyst within the co-clustering process. It provides meaningful co-clusters whose discovery and interpretation is increased by embedding GO annotations.