Bayesian models for the analysis of genetic structure when populations are correlated

  • Authors:
  • Rongwei Fu;Dipak K. Dey;Kent E. Holsinger

  • Affiliations:
  • Department of Statistics U-4120 University of Connecticut Storrs, CT 06269-4120, USA;Department of Statistics U-4120 University of Connecticut Storrs, CT 06269-4120, USA;Department of Ecology and Evolutionary Biology U-3043 University of Connecticut Storrs, CT 06269-4120, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: Population allele frequencies are correlated when populations have a shared history or when they exchange genes. Unfortunately, most models for allele frequency and inference about population structure ignore this correlation. Recent analytical results show that among populations, correlations can be very high, which could affect estimates of population genetic structure. In this study, we propose a mixture beta model to characterize the allele frequency distribution among populations. This formulation incorporates the correlation among populations as well as extending the model to data with different clusters of populations. Results: Using simulated data, we show that in general, the mixture model provides a good approximation of the among-population allele frequency distribution and a good estimate of correlation among populations. Results from fitting the mixture model to a dataset of genotypes at 377 autosomal microsatellite loci from human populations indicate high correlation among populations, which may not be appropriate to neglect. Traditional measures of population structure tend to overestimate the amount of genetic differentiation when correlation is neglected. Inference is performed in a Bayesian framework. Contact: fur@ohsu.edu