Estimating bacterial diversity from environmental DNA: a maximum likelihood approach

  • Authors:
  • Frederick Cohan;Danny Krizanc;Yun Lu

  • Affiliations:
  • Department of Biology, Wesleyan University, Middletown, CT;Department of Mathematics and Computer Science, Wesleyan University, Middletown, CT;Department of Mathematics and Computer Science, Wesleyan University, Middletown, CT

  • Venue:
  • ISBRA'07 Proceedings of the 3rd international conference on Bioinformatics research and applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

The ability to measure bacterial diversity is a prerequisite for the systematic study of bacterial biogeography and ecology. In this paper we describe a method of estimating diversity from an environmental sample of DNA and apply it to data taken from samples from the Sargasso Sea. Our approach combines the coverage depth method of Venter et al. [2] and the contig spectrum approach of Angly et al. [4], but uses maximum likelihood to recover the diversity rather than using hand-fit models as in [2]. We assume four species abundance distributions, then maximize the likelihood of fitting the coverage depth at different positions of the consensus sequence provided in the Sargasso Sea sample. The resulting estimates match well with those obtained using less mathematically rigorous approaches.