LOGOS: a modular Bayesian model for de novo motif detection

  • Authors:
  • Eric P. Xing;Wei Wu;Michael I. Jordan;Richard M. Karp

  • Affiliations:
  • -;-;-;-

  • Venue:
  • CSB '03 Proceedings of the IEEE Computer Society Conference on Bioinformatics
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The complexity of the global organization and internalstructures of motifs in higher eukaryotic organisms raisessignificant challenges for motif detection techniques. Toachieve successful de novo motif detection it is necessary tomodel the complex dependencies within and among motifsand incorporate biological prior knowledge. In this paper,we present LOGOS, an integrated LOcal and GlObal motifSequence model for biopolymer sequences, which providesa principled framework for developing, modularizing,extending and computing expressive motif models forcomplex biopolymer sequence analysis. LOGOS consistsof two interacting submodels: HMDM, a local alignmentmodel capturing biological prior knowledge and positionaldependence within the motif local structure; and HMM, aglobal motif distribution model modeling frequencies anddependencies of motif occurrences. Model parameters canbe fit using training motifs within an empirical Bayesianframework. A variational EM algorithm is developed for denovo motif detection. LOGOS improves over existing modelsthat ignore biological priors and dependencies in motifstructures and motif occurrences, and demonstrates superiorperformance on both semi-realistic test data and cis-regulatorysequences from yeast and Drosophila sequenceswith regard to sensitivity, specificity, flexibility and extensibility.