Gene Networks Viewed through Two Models
BICoB '09 Proceedings of the 1st International Conference on Bioinformatics and Computational Biology
Bayesian Learning in Sparse Graphical Factor Models via Variational Mean-Field Annealing
The Journal of Machine Learning Research
Proceedings of the 2nd ACM Conference on Bioinformatics, Computational Biology and Biomedicine
Reverse engineering of gene regulatory networks from biological data
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Hi-index | 3.85 |
Motivation: Statistical inference of gene networks by using time-course microarray gene expression profiles is an essential step towards understanding the temporal structure of gene regulatory mechanisms. Unfortunately, most of the current studies have been limited to analysing a small number of genes because the length of time-course gene expression profiles is fairly short. One promising approach to overcome such a limitation is to infer gene networks by exploring the potential transcriptional modules which are sets of genes sharing a common function or involved in the same pathway. Results: In this article, we present a novel approach based on the state space model to identify the transcriptional modules and module-based gene networks simultaneously. The state space model has the potential to infer large-scale gene networks, e.g. of order 103, from time-course gene expression profiles. Particularly, we succeeded in the identification of a cell cycle system by using the gene expression profiles of Saccharomyces cerevisiae in which the length of the time-course and number of genes were 24 and 4382, respectively. However, when analysing shorter time-course data, e.g. of length 10 or less, the parameter estimations of the state space model often fail due to overfitting. To extend the applicability of the state space model, we provide an approach to use the technical replicates of gene expression profiles, which are often measured in duplicate or triplicate. The use of technical replicates is important for achieving highly-efficient inferences of gene networks with short time-course data. The potential of the proposed method has been demonstrated through the time-course analysis of the gene expression profiles of human umbilical vein endothelial cells (HUVECs) undergoing growth factor deprivation-induced apoptosis. Availability: Supplementary Information and the software (TRANS-MNET) are available at http://daweb.ism.ac.jp/~yoshidar/software/ssm/ Contact: yoshidar@ism.ac.jp Supplementary information: Supplementary data are available at Bioinformatics online.