A multiple-feature framework for modelling and predicting transcription factor binding sites

  • Authors:
  • Rainer Pudimat;Ernst-Günter Schukat-Talamazzini;Rolf Backofen

  • Affiliations:
  • Institut für Informatik, Friedrich-Schiller-Universität Ernst-Abbe-Platz 3, D-07743 Jena, Germany;Institut für Informatik, Friedrich-Schiller-Universität Ernst-Abbe-Platz 3, D-07743 Jena, Germany;Institut für Informatik, Friedrich-Schiller-Universität Ernst-Abbe-Platz 3, D-07743 Jena, Germany

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: The identification of transcription factor binding sites in promoter sequences is an important problem, since it reveals information about the transcriptional regulation of genes. For analysing transcriptional regulation, computational approaches for predicting putative binding sites are applied. Commonly used stochastic models for binding sites are position-specific score matrices, which show weak predictive power. Results: We have developed a probabilistic modelling approach, which allows to consider diverse characteristic binding site properties to obtain more accurate representations of binding sites. These properties are modelled as random variables in Bayesian networks, which are capable of dealing with dependencies among binding site properties. Cross-validation on several datasets shows improvements in the false positive error rate and the significance (P-value) of true binding sites. Supplementary information: A more extensive description of validation results are available at http://www.bio.inf.uni-jena.de/Software/promapper/ Contact: backofen@inf.uni-jena.de