Integrating genomic binding site predictions using real-valued meta classifiers

  • Authors:
  • Yi Sun;Mark Robinson;Rod Adams;Rene te Boekhorst;Alistair G. Rust;Neil Davey

  • Affiliations:
  • University of Hertfordshire, Science and Technology Research Institute, College Lane, Hatfield, AL10 9AB, Hertfordshire, UK;Michigan State University, Department of Biochemistry and Molecular Biology, 48824, East Lansing, MI, USA;University of Hertfordshire, Science and Technology Research Institute, College Lane, Hatfield, AL10 9AB, Hertfordshire, UK;University of Hertfordshire, Science and Technology Research Institute, College Lane, Hatfield, AL10 9AB, Hertfordshire, UK;Institute for Systems Biology, 1441 North 34th Street, 981-3, Seattle, WA, USA;University of Hertfordshire, Science and Technology Research Institute, College Lane, Hatfield, AL10 9AB, Hertfordshire, UK

  • Venue:
  • Neural Computing and Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Currently the best algorithms for predicting transcription factor binding sites in DNA sequences are severely limited in accuracy. There is good reason to believe that predictions from different classes of algorithms could be used in conjunction to improve the quality of predictions. In this paper, we apply single layer networks, rules sets, support vector machines and the Adaboost algorithm to predictions from 12 key real valued algorithms. Furthermore, we use a ‘window’ of consecutive results as the input vector in order to contextualise the neighbouring results. We improve the classification result with the aid of under- and over-sampling techniques. We find that support vector machines and the Adaboost algorithm outperform the original individual algorithms and the other classifiers employed in this work. In particular they give a better tradeoff between recall and precision.