Applications of beta-mixture models in bioinformatics

  • Authors:
  • Yuan Ji;Chunlei Wu;Ping Liu;Jing Wang;Kevin R. Coombes

  • Affiliations:
  • Department of Biostatistics, The University of Texas M.D. Anderson Cancer Center Houston, TX 77030, USA;3M Pharmaceuticals St Paul, Minnesota, USA;Department of Biostatistics, The University of Texas M.D. Anderson Cancer Center Houston, TX 77030, USA;Department of Biostatistics, The University of Texas M.D. Anderson Cancer Center Houston, TX 77030, USA;Department of Biostatistics, The University of Texas M.D. Anderson Cancer Center Houston, TX 77030, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.84

Visualization

Abstract

Summary: We propose a beta-mixture model approach to solve a variety of problems related to correlations of gene-expression levels. For example, in meta-analyses of microarray gene-expression datasets, a threshold value of correlation coefficients for gene-expression levels is used to decide whether gene-expression levels are strongly correlated across studies. Ad hoc threshold values such as 0.5 are often used. In this paper, we use a beta-mixture model approach to divide the correlation coefficients into several populations so that the large correlation coefficients can be identified. Another important application of the proposed method is in finding co-expressed genes. Two examples are provided to illustrate both applications. Through our analysis, we also discover that the popular model selection criteria BIC and AIC are not suitable for the beta-mixture model. To determine the number of components in the mixture model, we suggest an alternative criterion, ICL--BIC, which is shown to perform better in selecting the correct mixture model. Contact: yuanji@mdanderson.org Supplementary information: http://odin.mdacc.tmc.edu/~yuanj/highcorgeneanno.html