Hierarchical Bayes variable selection and microarray experiments

  • Authors:
  • David J. Nott;Zeming Yu;Eva Chan;Chris Cotsapas;Mark J. Cowley;Jeremy Pulvers;Rohan Williams;Peter Little

  • Affiliations:
  • Department of Statistics, University of New South Wales, Sydney 2052, Australia and School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney 2052, Australia;Department of Statistics, University of New South Wales, Sydney 2052, Australia;School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney 2052, Australia;School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney 2052, Australia;School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney 2052, Australia;School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney 2052, Australia;School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney 2052, Australia;School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney 2052, Australia

  • Venue:
  • Journal of Multivariate Analysis
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hierarchical and empirical Bayes approaches to inference are attractive for data arising from microarray gene expression studies because of their ability to borrow strength across genes in making inferences. Here we focus on the simplest case where we have data from replicated two colour arrays which compare two samples and where we wish to decide which genes are differentially expressed and obtain estimates of operating characteristics such as false discovery rates. The purpose of this paper is to examine the frequentist performance of Bayesian variable selection approaches to this problem for different prior specifications and to examine the effect on inference of commonly used empirical Bayes approximations to hierarchical Bayes procedures. The paper makes three main contributions. First, we describe how the log odds of differential expression can usually be computed analytically in the case where a double tailed exponential prior is used for gene effects rather than a normal prior, which gives an alternative to the commonly used B-statistic for ranking genes in simple comparative experiments. The second contribution of the paper is to compare empirical Bayes procedures for detecting differential expression with hierarchical Bayes methods which account for uncertainty in prior hyperparameters to examine how much is lost in using the commonly employed empirical Bayes approximations. Third, we describe an efficient MCMC scheme for carrying out the computations required for the hierarchical Bayes procedures. Comparisons are made via simulation studies where the simulated data are obtained by fitting models to some real microarray data sets. The results have implications for analysis of microarray data using parametric hierarchical and empirical Bayes methods for more complex experimental designs: generally we find that the empirical Bayes methods work well, which supports their use in the analysis of more complex experiments when a full hierarchical Bayes analysis would impose heavy computational demands.