MC4: a tempering algorithm for large-sample network inference

Authors:
Daniel James Barker;Steven M. Hill;Sach Mukherjee
Affiliations:
Centre for Complexity Science, University of Warwick, Coventry, UK and Department of Physics, University of Warwick, Coventry, UK;Centre for Complexity Science, University of Warwick, Coventry, UK and Department of Statistics, University of Warwick, Coventry, UK;Department of Statistics, University of Warwick, Coventry, UK and Centre for Complexity Science, University of Warwick, Coventry, UK
Venue:
PRIB'10 Proceedings of the 5th IAPR international conference on Pattern recognition in bioinformatics
Year:
2010

Citing 5
Cited 0

Dynamic bayesian networks: representation, inference and learning

Dynamic bayesian networks: representation, inference and learning
Monte Carlo Statistical Methods (Springer Texts in Statistics)

Monte Carlo Statistical Methods (Springer Texts in Statistics)
Advances to Bayesian network inference for generating causal networks from observational biological data

Bioinformatics
A Recursive Method for Structural Learning of Directed Acyclic Graphs

The Journal of Machine Learning Research
Monte Carlo Strategies in Scientific Computing

Monte Carlo Strategies in Scientific Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Bayesian networks and their variants are widely used for modelling gene regulatory and protein signalling networks. In many settings, it is the underlying network structure itself that is the object of inference. Within a Bayesian framework inferences regarding network structure are made via a posterior probability distribution over graphs. However, in practical problems, the space of graphs is usually too large to permit exact inference, motivating the use of approximate approaches. An MCMC-based algorithm known as MC3 is widely used for network inference in this setting. We argue that recent trends towards larger sample size datasets, while otherwise advantageous, can, for reasons related to concentration of posterior mass, render inference by MC3 harder. We therefore exploit an approach known as parallel tempering to put forward an algorithm for network inference which we call MC4. We show empirical results on both synthetic and proteomic data which highlight the ability of MC4 to converge faster and thereby yield demonstrably accurate results, even in challenging settings where MC3 fails.